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PREFACE 


The  mathematical  and  statistical  software  library  which  is  in  residence 
in  the  Air  Force  Human  Resources  Laboratory  (AFHRL)  Univac  1108  computer 
system  is  maintained  and  undergoes  continued  development  in  support  of  Air 
Force  personnel  and  training  research  with  the  objectives  to  facilitate  and 
to  improve  the  analysis  of  human  resources  data.  The  compilation  of  this 
software  index  is  in  line  with  these  objectives  making  available  an  abridged 
documentation  source  for  the  software  library  which  will  expedite  the  work  of 
Air  Force  personnel  researchers.  Work  on  the  software  index  was  accomplished 
under  project  6323,  Personnel  Data  Analysis;  task  632305,  Development  of 
Analytic  Methodology  for  Air  Force  Personnel  Research  Data. 

Many  people  over  a number  of  years  have  contributed  to  the  development  and 
implementation  of  programs  in  the  AFHRL  mathematical  and  statistical  software 
library.  While  those  responsible  for  the  specific  programs  are  identified  in 
their  formal  program  documentation  reports,  it  is  appropriate  to  acknowledge 
here  the  key  personnel  currently  responsible  for  the  ongoing  development  of 
the  software  library.  They  include  Dr.  Janos  B.  Koplyay,  Mr.  C.  Deene  Gott, 
Mr.  Charles  R.  Rogers,  and  Mr.  Jacob  E.  Ntyer,  Jr.  Special  acknowledgement 
also  goes  to  Mr.  Larry  K.  Whitehead  for  his  programming  efforts  in  program 
development,  implementation,  and  maintenance. 
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1.  INTRODUCTION 


THE  COMPUTATIONAL  SCIENCES  DIVISION  ( SM ) OF  THE  AIR  FORCE 
HUMAN  RESOURCES  LABORATORY  (AFHRL)  IS  RESPONSIBLE  FOR  CONDUCTING 
RESEARCH  AND  DEVELOPMENT  IN  THE  AREAS  OF  STATISTICS,  MATHEMATICS, 
COMPUTER  OPERATIONS,  AND  RESEARCH  DESIGN  IN  SUPPORT  OF  AIR  FORCE 
PERSONNEL  AND  TRAINING  RESEARCH  WITH  THE  OBJECTIVES  OF 
FACILITATING  AND  IMPROVING  THE  ANALYSIS  OF  HUMAN  RESOURCES  DATA. 
AS  A RESULT  OF  THIS  MISSION,  A SIZEABLE  MATHEMATICAL  AND 
STATISTICAL  SOFTWARE  LIBRARY  WHICH  ENCOMPASSES  A BROAD  RANGE  OF 
METHODOLOGY  IS  RESIDENT  IN  THE  AFHRL  UNIVAC  1108  COMPUTER  SYSTEM 
HOUSED  IN  SM.  PAST  SM  EXPERIENCE  WITH  AIR  FORCE  PERSONNEL 
RESEARCHERS  HAS  REVEALED  A NEED  FOR  AN  ABRIDGED  DOCUMENTATION 
SOURCE  ON  THE  SOFTWARE  LIBRARY  WHICH  RESEARCHERS  MAY  QUICKLY  SCAN 
TO  IDENTIFY  THE  MATHEMATICAL  OR  STATISTICAL  SOFTWARE  WHICH  IS 
CURRENTLY  OPERATIONAL  ON  THE  AFHRL  COMPUTER  SYSTEM  AND  IS 
AVAILABLE  TO  THEM  FOR  USE  IN  THEIR  RESEARCH  PROJECTS.  THE  PURPOSE 
OF  THIS  REPORT  IS  TO  FILL  THIS  NEED  BY  PROVIDING  A SOFTWARE  INDEX 
WHICH  CONTAINS  BRIEF  DESCRIPTIONS  OF  GENERAL-PURPOSE  MATHEMATICAL 
AND  STATISTICAL  SOFTWARE  IN  THE  AFHRL  SOFTWARE  LIBRARY. 

THE  SOFTWARE  DESCRIPTIONS  PROVIDED  IN  THIS  REPORT  ARE 
ORGANIZED  INTO  THREE  CHAPTERS  (I.E.,  CHAPTERS  2 THROUGH  M)  . 
CHAPTER  2 DEALS  WITH  THE  "STAND-ALONE"  PROGRAMS  IN  THE  SOFTWARE 
LIBRARY,  WHILE  CHAPTER  3 DESCRIBES  THE  LIBRARY’S  SUBROUTINE 
SYSTEMS.  THE  FINAL  CHAPTER,  CHAPTER  S,  DISCUSSES  THE  NATIONALLY 
RECOGNIZED  STATISTICAL  PACKAGES  WHICH  ARE  ALSO  AVAILABLE  IN  THE 
SOFTWARE  LIBRARY. 

CHAPTER  2 CONTAINS  INFORMATION  ON  27  PROGRAMS,  WITH  THE 

information  for  each  program  being  comprised  of  two  parts:  an 
abstract  and  a list  of  program  documentation  references  for  the 

READER  WHO  WISHES  TO  OBTAIN  THE  FORMAL  DOCUMENTATION  FOR  THE 
PROGRAM.  IN  EACH  CASE,  THE  INFORMATION  INCLUDED  IN  THE  ABSTRACT 
FOR  A PROGRAM  GIVES  A GENERAL  DESCRIPTION  OF  THE  PROGRAMS  THE 
ORIGIN  OF  THE  PROGRAM,  IF  THE  PROGRAM  WAS  OBTAINED  FROM  A SOURCE 
OUTSIDE  OF  AFHRL?  IMPORTANT  PROGRAM  RESTRICTIONS  (E.G.,  MAXIMUM 
NUMBER  OF  VARIABLES  PERMITTED)  • AND  A WARNING  (IF  APPLICABLE) 
ABOUT  EXCESSIVE  COMPUTER  EXECUTION  TIME  WHICH  MIGHT  BE  INCURRED  IN 
CERTAIN  CIRCUMSTANCES.  FOLLOWING  THE  ABSTRACT  IN  A SET  OF 
PARENTHESES  IS  THE  OFFICE  SYMBOL  OF  THE  SM  BRANCH  RESPONSIBLE  FOR 
THE  PROGRAM  (I.E.,  "SMA"  FOR  THE  ANALYSIS  AND  PROGRAMMING  BRANCH, 
"SMO"  FOR  THE  COMPUTER  OPERATIONS  BRANCH,  AND  "SMS"  FOR  THE 
STATISTICAL  AND  COMPUTER  TECHNOLOGY  BRANCH).  THE  DOCUMENTATION 
REFERENCE  FOR  EACH  PROGRAM  IS  USUALLY  IN  THE  FORM  OF  A TECHNICAL 
REPORT  OR  A COMPUTERIZED  DOCUMENTATION  REPORT  PRODUCED  BY  THE 
UNIVAC  1108  DOC  PROCESSOR.  FOR  THE  READER  WITH  ACCESS  TO  THE 
AFHRL  UNIVAC  1108,  THE  DOCUMENTATION  REFERENCE  ALSO  CONTAINS  (IF 
APPLICABLE)  THE  RETRIEVAL  COMMAND  WHICH  CAN  BE  EXECUTED  TO  SECURE 
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A COPY  OF  THE  DOC  PROCESSOR  REPORT.  READERS  WHO  DO  NOT  HAVE 
ACCESS  TO  THE  AFHRL  COMPUTER  AND  WISH  TO  OBTAIN  A COPY  OF  THE  DOC 
PROCESSOR  REPORT  MAY  ADDRESS  THEIR  REQUESTS  TO  AFHRL/SM,  BROOKS 
AIR  FORCE  BASE,  TEXAS  7823S.  (NOTE.  NON-AIR  FORCE  REQUESTERS  WILL 
BE  REQUIRED  TO  SECURE  RELEASE  APPROVAL  FOR  THE  REQUESTED  SOFTWARE 
DOCUMENTATION  FROM  THE  PR06R  AMS  AND  TECHNOLOGY  DIVISION  OF  DATA 
AUTOMATION,  HEADQUARTERS  AIR  FORCE  SYSTEMS  COMMAND  (HQ 
AFSC/ADCT I . ) 

CHAPTER  3 PROVIDES  INFORMATION  ON  THE  SOFTWARE  LIBRARY’S 
THREE  MATHEMATICAL  AND/OR  STATISTICAL  SUBROUTINE  SYSTEMS.  A 
GENERAL  DESCRIPTION  OF  EACH  SUBROUTINE  SYSTEM  Is  GIVEN  ALONG  WITH 
A LIST  OF  THE  SUBROUTINES  CONTAINED  IN  EACH  SYSTEM  AND  A LIST  OF 
DOCUMENTATION  REFERENCES. 

CHAPTER  H OF  THIS  REPORT  IDENTIFIES  THE  TWO  NATIONALLY 
RECOGNIZED  STATISTICAL  PACKAGES  WHICH  ARE  AVAILABLE  IN  THE 
SOFTWARE  LIBRARY.  THE  INFORMATION  PROVIDED  ON  EACH  PACKAGE 
INCLUDES  A GENERAL  DESCRIPTION  OF  THE  PACKAGE,  A LIST  AND  BRIEF 
DESCRIPTIONS  OF  THE  PROGRAMS  CONTAINED  IN  THE  PACKAGE,  AND  A LIST 
OF  REFERENCES  TO  THE  PACKAGE  DOCUMENTATION. 
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2.  mathematical  and  statistical  programs 


2.1.  AID-H  (AUTOMATIC  INTERACTION  DETECTOR) 


2.1.1.  ABSTRACT 


AID-9  IS  A COMPUTER  PROGRAM  WRITTEN  PRIMARILY  IN  FORTRAN  AND 
ADAPTED  From  an  EARLIER  PROGRAM  <SONQUIST  and  MORGAN,  19AM) 
OBTAINED  FROM  THE  UNIVERSITY  OF  MICHIGAN.  IT  IS  USEFUL  IN  THE 
AUTOMATIC  IDENTIFICATION  OF  INTERACTIONS  AMONG  PREDICTOR  VARIABLES 
FOR  MULTIPLE  LINEAR  REGRESSION  ANALYSIS  AND  IS  ESSENTIALLY  A MODEL 
IDENTIFICATION  PROCESS  SINCE  IT  PROVIDES  AN  OPTIMUM  SOLUTION  FOR 
THE  ANALYSIS.  GIVEN  AN  INITIAL  GROUP  OF  DATA,  THE  DEFINITION  OF 
THE  CRITERION  VARIABLE,  AND  THE  DEFINITION  OF  THE  PREDICTOR 
VARIABLES,  THE  BASIC  IDEA  OF  THE  AID-9  ALGORITHM  IS  TO  EXPLAIN  THE 
VARIANCE  OF  THE  CRITERION  VARIABLE  BY  THE  SEQUENTIAL  SPLITTING  OF 
the  ORIGINAL  GROUP  INTO  SUBGROUPS.  THE  SPLITTING  IS  DONE  IN  SUCH 
A WAV  AS  TO  MINIMIZE  THE  WITHIN  GROUPS  (ERROR)  SUM  OF  SQUARES. 
THIS  IS  ACCOMPLISHED  BY  THE  EXAMINATION  OF  EACH  POSSIBLE  SPLIT  OF 
EVERY  PREDICTOR  VARIABLE  OF  THE  CURRENT  CANDIDATE  GROUP  TO  BE 
SPLIT  AND  BY  THE  SELECTION  OF  THE  SPLIT  GIVING  THE  SMALLEST  WITHIN 
GROUPS  SUM  OF  SQUARES.  THREE  BASIC  STATISTICS  ARE  REPORTED  AT 
EACH  SPLIT;  AN  R-SQUARED  VALUE  INDICATING  THE  PERCENTAGE  OF  THE 
CRITERION  VARIANCE  EXPLAINED  THROUGH  THE  CURRENT  SPLITTING  STATE! 
AN  F-VALUE  INDICATING  THE  SIGNIFICANCE  OF  THE  REDUCTION  IN  THE 
ERROR  SUM  OF  SQUARES  DUE  TO  THE  CURRENT  SPLIT;  AND  AN  F-VALUE  FOR 
A ONE-WAY  ANALYSIS  OF  VARIANCE  CONSIDERING  ALL  GROUPS  AT  THIS 
STAGE.  AlD-9  WILL  HANDLE  UP  TO  300  PREDICTOR  VARIABLES  WHICH  MAY 
BE  EITHER  CATEGORICAL  OR  CONTINUOUS.  ALL  PREDICTORS  ARE  RECODED 
TO  CATEGORICAL  VARIABLES  ACCORDING  TO  USER  SPECIFICATIONS.  THE 
MAXIMUM  NUMBER  OF  RECODE  CATEGORIES  FOR  A PREDICTOR  IS  50.  (SMS) 


2.1.2.  DOCUMENTATION  REFERENCES 


KOPLYAY,  J.B.,  GOTT,  C.D-,  & ELTON,  J.H.  AUTOMATIC 

INTERACTION  DETECTOR-VERSION  9 ( A I D ) - H REFERENCE  MANUAL. 

AFHRL-TR-73- 1 7 , AD-773  803.  LACKLAND  A F B , Tx!  PERSONNEL 
RESEARCH  DIVISION,  AIR  FORCE  HUMAN  RESOURCES  LABORATORY, 
OCTOBER  1973. 
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GOTT,  C.D.,  & KOPLYAY,  J.B.  AUTOMATIC  INTERACTION  DETECTOR 

-VERSION  H (AID)-H  REFERENCE  MANUAL  ADDENDUM  !•  AFHRL-TR 
-77-30  , AD-A0R2968.  BROOKS  AFB,  TXI  COMPUTATIONAL  SCIENCES 
DIVISION,  AIR  FORCE  HUMAN  RESOURCES  LABORATORY,  JULY  1977. 

SONQUIST,  J.A.,  & MORGAN,  J.N.  THE  DETECTION  OF  INTERACTION 
EFFECTS.  SURVEY  RESEARCH  CENTER,  INSTITUTE  FOR  SOCIAL 
RESEARCH,  UNIVERSITY  OF  MICHIGAN,  ANN  ARBOR,  1968. 
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2.2.  ANOVa-UNEQ  (UNEQUAL  N ANALYSIS  OF  VARIANCE) 


2.2.1.  ABSTRACT 


ANOVa-UNEQ  PERFORMS  ONE-  OR  TWO-WAY  ANALYSIS  OF  VARIANCE  FOR 
EXPERIMENTAL  DATA  HAVING  UNEQUAL  (OR  EQUAL)  CELL  FREQUENCIES.  THE 
two-way  analysis  is  carried  out  using  an  approximation  procedure 
which  yields  identical  f-ratios  to  those  produced  in  an  unweighted 
means  analysis.  in  addition  to  the  customary  COMPUTATIONS,  a 
CHI-SQUARE  value  CAN  BE  COMPUTED  FOR  BARTLETT’S  HOMOGENEITY  OF 
VARIANCE  TEST  AND,  IN  THE  ONE-WAY  ANALYSIS  OF  VARIANCE,  AN 
OMEGA-SQUARE  STRENGTH  OF  ASSOCIATION  VALUE  CAN  BE  REPORTED.  A 
DUNCAN  MULTIPLE  RANGE  TEST  FOR  INVESTIGATION  OF  THE  DIFFERENCES 
BETWEEN  ALL  PAIRS  OF  CELL  MEANS  CAN  BE  PERFORMED  AT  THE  OPTION  OF 
THE  USER. 

ANOVa-UNEQ  WILL  NOT  PERFORM  AN  ANALYSIS  FOR  AN  EXPERIMENT 
HAVING  ANY  EMPTY  CELLS  OR  ANY  CELLS  WITH  ONE  OBSERVATION  ONLY. 
THE  MAXIMUM  NUMBER  OF  GROUPS  WHICH  MAY  BE  PRESENT  IN  THE  ANALYSIS 
OF  VARIANCE  IS  20,  WHILE  THE  MAXIMUM  NUMBER  OF  PROBLEMS  WHICH  CAN 
BE  PERFORMED  ON  ONE  SAMPLE  IS  ALSO  20.  (SMS) 


2.2.2.  DOCUMENTATION  REFERENCE 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
ANOVA-UNEQ. 

ANOVA-UNEQ  REPORT  RETRIEVAL  COMMAND:  8T-T.DOC  ANOVA-UNEQ. 
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2.3.  BAYS  (BAYESIAN  METHOD  FOR  CATEGORICAL  PREDICTION) 


2.3.1.  ABSTRACT 


WITH  A SET  OF  CATEGORICAL  PREDICTORS  AND  A CATEGORICAL 
CRITERION  AS  INPUT  VARIABLES,  THE  OBJECTIVE  OF  THE  ANALYSIS 
CARRIED  OUT  BY  THE  BAYS  PROGRAM  IS  TO  FIND  A SUBSET  OF  THE 
PREDICTORS  WHICH  MOST  EFFECTIVELY  DISCRIMINATES  AMONG  THE 
CRITERION  CATEGORIES.  BAYS  IS  BASED  ON  A MODIFIED  VERSION  OF  THE 
"ABCD  TECHNIQUE"  DESCRIBED  BY  MOONAN  (1972)  AND  USES  BAYES* 
FORMULA  TO  COMPUTE  PROBABILITIES  OF  CLASS  MEMBERSHIP  FOR  EACH  CASE 
WITH  THE  RESULT  THAT  AN  INDIVIDUAL  IS  ASSIGNED  TO  THE  CRITERION 
CATEGORY  FOR  WHICH  HIS  A POSTERIORI  PROBABILITY  IS  HIGHEST  OR  HIS 
COST  OF  MISCLASS1F1CATION  IS  LOWEST  (AT  THE  OPTION  OF  THE  USER). 
AN  IMPROVEMENT  TO  THE  ABCD  TECHNIQUE  IS  EMPLOYED  BY  BAYS  IN  WHICH 
A STEPWISE  PROCEDURE  IN  THE  MODEL-BUILDING  ALGORITHM  CAN  CAUSE 
VARIABLES  TO  BE  ELIMINATED  AFTER  THEY  HAVE  BEEN  ADDED  TO  THE 
PREDICTIVE  SCHEME.  AT  EACH  STAGE  OF  THE  MODEL-BUILDING  PROCESS, 
THE  PREDICTIVE  COMPOSITE  IS  FORMED  WHICH  CORRESPONDS  TO  THE 
HIGHEST  CLASSIFICATION  ACCURACY  RESULTING  FROM  ALL  POSSIBLE 
ADDITIONS  (OR  DELETIONS)  OF  ONE  VARIABLE  To  (OR  FROM)  THE 
PREDICTIVE  COMPOSITE  EXISTING  AT  THE  PREVIOUS  STaGE.  BY  REVIEWING 
THE  HIT  TABLE  OUTPUT  PRODUCED  AT  EACH  STAGE,  A USER  CAN  COMPARE 
the  classification  accuracies  of  the  PREDICTIVE  COMPOSITES  as  THEY 
ARE  GENERATED.  ADDITIONALLY,  PREDICTIVE  COMPOSITES  CAN  BE 
EVALUATED  IN  TERMS  OF  M I SC L A S S I F I C A T I 0 N COSTS.  THE  FOLLOWING 
OPTIONS  ARE  AVAILABLE  TO  THE  BAYS  USER : (A)  EMPIRICAL 
PROBABILITIES  may  be  supplied  by  the  user  or  the  program  will 

COMPUTE  THEM  FROM  THE  APPROPRIATE  SAMPLE(S)*.  IB)  THE  ITEMS  MAY  BE 
SELECTED  SO  AS  TO  MAXIMIZE  CORRECT  CLASSIFICATIONS  OR  MINIMIZE 
EXPECTED  COSTS  OF  M I SCLASS I F I C AT  I ONS I (C)  CERTAIN  VARIABLES  MAY  BE 
FORCED  INTO  THE  PREDICTIVE  SCHEME  INITIALLY!  AND  (D)  THE  SET  OF 
PREDICTORS  SELECTED  MAY  BE  CROSS-VALIDATED  ON  UP  TO  TWO  SAMPLES 
WITHIN  THE  SAME  RUN. 

BAYS  IS  PROGRAMMED  TO  HANDLE  UP  TO  200  ITEMS,  FIVE  CRITERION 
CATEGORIES,  AND  THREE  SAMPLES,  WITH  THE  LIMITATION  THAT  EXCESSIVE 
COMPUTER  RUN  TIMES  CAN  BE  INCURRED  FOR  PROBLEMS  INVOLVING  LARGE 
SAMPLES.  (SMS) 
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2.3.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  ( UNPUBL I SHED  I *. 
BAYS. 


BAYS  REPORT  RETRIEVAL  COMMAND;  BT  *T .DOC  BAYS. 
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TITLE  - 
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2.4.  CANCOR  (CANONICAL  CORRELATION) 


2.4.1.  ABSTRACT 


THE  PURPOSE  OF  CANONICAL  ANALYSES  IS  TO  DEFINE  THE  PRIMARY 
INDEPENDENT  DIMENSIONS  (BASIS  VECTORS)  WHICH  RELATE  ONE  SET  OF 
VARIABLES  TO  ANOTHER  SET  OF  VARIABLES.  THE  TECHNIQUE  IS  PRIMARILY 
DESCRIPTIVE.  ALTHOUGH  THE  METHOD  INVOLVES  FINDING  SETS  OF  WEIGHTS 
WHICH  MAXIMIZE  THE  CORRELATIONS  BETWEEN  TWO  COMPOSITE  VARIABLES 
(ONE  FOR  EACH  SET  OF  ORIGINAL  VARIABLES).  THE  TWO  OUTPUTS  SHOULD 
SUGGEST  ANSWERS  TO  QUESTIONS  CONCERNING  THE  NUMBER  OF  WAYS  THE  TWO 
SETS  OF  MEASURES  ARE  RELATED,  THE  STRENGTHS  OF  ThE  RELATIONSHIPS, 
AND  THE  NATURE  OF  THE  RELATIONSHIPS  SO  DEFINED. 

THIS  CANONICAL  CORRELATION  ROUTINE  IS  A MODIFICATION  AND 
COMBINATION  OF  THE  ROUTINES  IN  "MULTIVARIATE  PROCEDURES  FOR  THE 
BEHAVIORAL  SCIENCES"  BY  COOLEY  AND  LOHNES  (1962)  AND  "FORTRAN 
PROGRAMMING  FOR  THE  BEHAVIORAL  SCIENCES"  BY  VELDMAN  (1967).  THE 
PROGRAM  COMPUTES  A FULL  SET  OF  CANONICAL  CORRELATIONS  FOR  TWO  SETS 
OF  VARIABLES.  THE  OUTPUT  CONTAINS  MEANS  AND  STANDARD  DEVIATIONS 
FOR  ALL  VARIABLES',  SET  A CORRELATIONS*.  SET  B CORRELATIONS)  CROSS 
CORRELATIONS  (BETWEEN  THE  A AND  B VARIABLES)?  WILKS’  LAMBDA  AND 
CHI-SQUARE  FOR  THE  FULL  SET  OF  ROOTS.  ADDITIONALLY,  FOR  EACH 
INDIVIDUAL  ROOT,  THE  PROGRAM  REPORTS  THE  CHI-SQUARE  VALUE,  THE 
DEGREES  OF  FREEDOM,  THE  SIGNIFICANCE  OF  THE  CHI-SQUARE,  BOTH  SETS 
OF  WEIGHTS,  AND  (OPTIONAL)  CORRELATIONS  BETWEEN  THE  ORIGINAL  A AND 
B VARIABLES  AND  THE  CANONICAL  VARIATES. 

THE  PROGRAM  IS  RESTRICTED  TO  A MAXIMUM  OF  SO  VARIABLES  PER 
SET.  (SMS) 


2.4.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED);  TITLE  - 
CANCOR. 

CANCOR  REPORT  RETRIEVAL  COMMAND:  8T*T.D0C  CANCOR. 
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2.S.  COMPARE  (COMPARE  ONE-WAY  FREQUENCIES) 


2.S.I.  ABSTRACT 


THIS  PROGRAM  COMPARES  ONE-WAY  FREQUENCY  DISTRIBUTIONS  FROM 
DIFFERENT  INPUT  FILES.  THE  DISTRIBUTIONS  ARE  GENERATED  BY  THE  DIG 
PROGRAM.  (SEE  SECTION  2.8  FOR  A DESCRIPTION  OF  THE  DIG  PROGRAM.) 
COMPARE  IS  ESPECIALLY  APPROPRIATE  FOR  COMPARISON  OF  RESPONSES  TO 
SIMILAR  QUESTIONS  FROM  DIFFERENT  SURVEY  DATA  SETS. 

UP  TO  9 FREQUENCY  DISTRIBUTION  FILES  MAY  BE  INPUT  TO  A SINGLE 
COMPARE  RUN.  FREE  FORMAT  CONTROL  COMMANDS  DIRECT  THE  WAY  IN  WHICH 
THE  PROGRAM  WILL  ORGANIZE  THE  OUTPUT  REPORT. 

ALL  PRINTED  REPORTS  PRODUCED  BY  COMPARE  CONFORM  TO  THE 
AFHRL/SM  STANDARDS  FOR  FINAL  REPORT  PRODUCTS.  ISMA) 


2.5.2.  DOCUMENTATION  REFERENCE 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)!  TITLE  - 
SMAZ  UTILITY  PROGRAMS  & PROCESSORS. 

REPORT  RETRIEVAL  COMMAND!  8Z*Z.D0C. 
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2.6.  CROSS-CLAS  ( C ROS S -CL  A S S I F l C A T l ON ) 


2.6.1.  ABSTRACT 


THE  CROSS-CLAS  PROGRAM  PRODUCES  C R 0 S S - C L A S S I F I C A T I 0 N S (ALSO 
CALLED  CROSS-TABULATIONS  OR  CONTINGENCY  TABLESl  IN  TWO,  THREE,  OR 
FOUR  DIMENSIONS.  ROW  AND  COLUMN  PERCENTAGES,  CHI-SQUARES,  AND 
KENDALL'S  TAU  MAY  BE  OBTAINED  FOR  ALL  TABLES  IF  DESIRED. 

A MAXIMUM  OF  99  PROBLEMS  MAY  BE  EXECUTED  IN  ONE  RUN,  EACH 

problem  can  produce  up  to  bo  tables  from  a given  set  of  control 

CARDS  AND  data.  A GIVEN  SET  OF  DATA  MAY  CONSIST  OF  A MAXIMUM  OF 
99,999  OBSERVATIONS,  WITH  NO  MORE  THAN  80  VARIABLES  PER 
OBSERVATION.  DATA  MAY  BE  PROCESSED  AS  READ  IN,  OR  THEY  MAY,  IF 
SPECIFIED,  BE  RECODED  ACCORDING  TO  VARIOUS  SCHEMATA.  (SMS! 


2.6.2.  DOCUMENTATION  REFERENCE 

AFHRL  UN  1 v A C 1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
CROSS-CLAS. 

CROSS-CLAS  REPORT  RETRIEVAL  COMMAND:  0T*T.DOC  CROSS-CLAS. 
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2.7.  CURVE-FIT  t FOUR  I ER-PEARSON  CURVE-FITTING) 


2.7.1.  ABSTRACT 


CURVE-FIT  CONSISTS  OF  TWO  ROUTINES  FOR  FITTING  A CONTINUOUS 
DISTRIBUTION  TO  EMPIRICAL  STATISTICAL  DATA  FOR  WHICH  THE  PARENT 
POPULATION  IS  UNKNOWN.  THESE  ROUTINES  WERE  DEVELOPED  BY  JAMES  A. 
CHISMAN  AT  CLEMSON  UNIVERSITT  IN  1948.  ONE  ROUTINE  FITS  A FINITE 
FOURIER  SERIES  TO  THE  SAMPLE  POINTS.  THE  OTHER  ROUTINE  FITS  EACH 
OF  THE  12  PEARSON  DISTRIBUTION  TYPES  (PLUS  THE  NORMAL 
DISTRIBUTION)  TO  A SET  OF  DATA. 

THE  FOURIER  ROUTINE  ACCOMPLISHES  FOURIER  CURVE-FITTING  FOR  UP 
TO  SO  EQUALLY  SPACED  POINTS  AND  WILL  PROVIDE  THE  USER  WITH  THE 

following:  the  chi-square  test  for  goodness  of  fit:  various 

TABLES,  INCLUDING  A TABLE  OF  CUMULATIVE  PROBABILITIES;  AN  OPTIONAL 
graph  of  the  data  points  and  the  fitted  curve;  and  appropriate 
DIAGNOSTIC  MESSAGES. 

THE  PEARSON  ROUTINE  COMPUTES  THE  PARAMETERS  OF  EACH  OF  13 
PEARSON  STATISTICAL  DISTRIBUTION  TYPES  FROM  THE  FIRST  FOUR  MOMENTS 
OF  THE  DATA  AS  DESCRIBED  IN  "FREQUENCY  CURVES  AND  CORRELATION"  BY 
W.P.  ELOERTON  (1953).  A CHI-SQUARE  TEST  IS  PROVIDED  TO  COMPARE  THE 
FIT  OF  THE  VARIOUS  DISTRIBUTION  TYPES  ON  A GIVEN  SET  OF  DATA.  A 
MAXIMUM  OF  50  INPUT  POINTS  IS  ALLOWED  WITH  THE  REQUIREMENT  THAT 

they  be  equally  spaced.  <smsj 


2.7.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
CURVE-F 1 T . 


CURVE-FIT  REPORT  RETRIEVAL  COMMAND:  8T*T.D0C  CURVE-FIT. 
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2.8.  DIG  (FREQUENCY  DISTRIBUTION  GENERATOR) 


2.8.1.  ABSTRACT 


DIG  PROVIDES  A FAST  AND  EFFICIENT  METHOD  OF  DISTRIBUTING 
INFORMATION  ON  A FILE.  DIG  IS  ACTUALLY  A PROGRAM  GENERATOR  WHICH 
WILL  ACCEPT  MINIMAL  CONTROL  INFORMATION  AND  IN  TURN  AUTOMATICALLY 
GENERATE  INTERNAL  CODE  AND  EXECUTE  THE  CREATED  PROGRAM.  NOTABLE 
POINTS*. 

A.  MAY  HANDLE  500-600  COMPOSITE  FIELDS  IN  A SINGLE  PASS. 
MAXIMUM  FIELD  LENGTH  IS  30  FIELDATA  OR  20  ASCII  CHAR- 
ACTERS. 

B.  The  basic  ALGORITHM  is  complex,  using  nested  binary 
trees  for  internal  storage  and  an  asynchronous  merge 

(ORDER  16)  ON  EXTERNAL  STORAGE. 

C.  The  program  is  extremely  reliable  and  its  efficiency 

USUALLY  WILL  EXCEEO  THAT  OF  A SPECIALIZED  PROGRAM  TO 
DO  THE  SAME  JOB. 

D.  9S*  OF  THE  PROGRAMMING  WORK  IS  DONE  BY  THE  PROGRAM. 

E.  The  DIG  FREQUENCY  OUTPUT  FILE  MAY  be  REPORTED  USING 
several  programs  idip,  dimi2,  iway,  etc.). 

F.  CAN  HANDLE  an  INPUT  BLOCKSIZE  UP  to  2089  WORDS. 

G.  21  DISTRIBUTIONS  OF  1,072,328  CASES  (6  REELS)  USING 
Ss  FIELDS  OF  AN  AVERAGE  LENGTH  OF  3 FIELDATA  CHARAC- 
TERS TOOK  3 HOURS  19  MINUTES  OF  WALL  CLOCK  TIME  WITH 
APPROXIMATELY  1 HOUR  18  MINUTES  OF  CPU  TIME. 

( SMO  > 


2.8.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
DISTRIBUTION  GENERATOR,  DIG. 

DIG  REPORT  RETRIEVAL  COMMAND:  BS«S.DOC  DIG. 
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2.9.  0 I M I 2 (MULTI-VARIABLE  FREQUENCY  DISTRIBUTION) 

2.9.1.  ABSTRACT 


D I M 1 2 IS  A LARGE  MODULAR  PROGRAM  WHICH  ENABLES  THE  PRINTING  OF 
MULT  I -VAR UBLE  DISTRIBUTIONS,  INCLUDING  FREQUENCY,  PERCENTAGE, 
MEAN,  STANDARD  DEVIATION,  MEDIAN,  CUMULATIVE  FREQUENCY,  AND 
CUMULATIVE  PERCENTAGE  DISTRIBUTIONS  AS  DESIRED.  THE  MODULARITY  OF 
THE  PROGRAM  ALLOWS  FOR  THE  FOLLOWING: 

A.  USE  OF  LESS  CORE,  LEAVING  MORE  ROOM  FOR  OTHER  RUNS  IN 

the  system; 

B.  The  ABILITY  TO  EASILY  STOP  AND  RESTART  THE  PROGRAM 
WITH  MINIMUM  RECOVERY  EFFORT  IN  A SYSTEM  CRASH*.  AND 

C.  EASY  ADDITION  OF  NEW  MODULES. 

RESULTS  ARE  PRINTED  AS  A SERIES  OF  MATRICES  WITH  PROVISION  FOR 
UP  TO  950  COLUMNS  (10  COLUMNS  PER  PAGE)  ANO  3000  ROWS  (DOUBLE  OR 
SINGLE  SPACED).  EACH  MATRIX  CAN  BE  ASSOCIATED  WITH  ANY 

combination  of  up  to  io  major  variables,  where  each  major  variable 

IS  FIXED  AT  ANY  OF  300  POSSIBLE  VALUES.  AN  ADDITIONAL  FEATURE 
PERMITS  THE  USER  TO  RECODE  VALUES  ON  ANY  ROW  VARIABLE  AND  TO  FORCE 
THE  COLLAPSE  OF  DISTINCT  VALUES  ON  THE  ROW  VARIABLE  INTO  A SINGLE 
VALUE.  ENGLISH  LANGUAGE  DESCRIPTIONS  OF  CODED  VARIABLES  MAY  BE 
USED  TO  INCREASE  THE  READABILITY  OF  THE  TABLES.  THE  USER  MAY 
SPECIFY  COMPUTATION  OF  SUBTOTAL  ROWS  AT  DESIGNATED  POSITIONS  IN 
THE  MATRIX.  THE  DATA  DISPLAYED  FROM  THE  CHI-SQUARE  TEST  OF 
INDEPENDENCE  INCLUDE  THE  CHI-SQUARE  VALUE,  SIGNIFICANCE  LEVEL, 
OEGREES  OF  FREEDOM,  AND  CONTINGENCY  COEFFICIENT.  (SMAI 


2.9.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
FREQUENCY  DISTRIBUTION  PRINT  PROGRAM. 

REPORT  RETRIEVAL  COMMAND:  BZ*Z.DOC. 
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2.10.  DIP  (DISTRIBUTION  PRINT  PROGRAM) 


2.10.1.  ABSTRACT 


DIP  IS  A PRINT  ROUTINE  FOR  THE  OUTPUT  OF  A FREQUENCY 
DISTRIBUTION  GENERATING  PROGRAM  ( E . G . * DIG).  DIP  PACKS  THE 
MAXIMUM  NUMBER  OF  COLUMNS  AND  ROWS  PER  PAGE  BASED  ON  THE  FIELD 
SIZE  AND  AUTOMATICALLY  DISPLAYS  SUBTOTALS  FOR  EACH  LEVEL  OF 
INTERMEDIATE  AND  MAJOR  VARIABLES  WITHIN  MULT  I -VAR  I ATE  DISTRIBUTION 
REPORTS.  A TEST  ON  THE  RESULTS  OF  21  DISTRIBUTIONS  OF  1,072,328 
CASES  AS  PRODUCED  BY  DIG  TOOK  APPROXIMATELY  13  MINUTES  OF  WALL 
CLOCK  TIME  AND  ABOUT  6 MINUTES  OF  CPU  TIME  TO  PRINT  1033  PAGES. 
THE  PRINT  WAS  M TO  5 COLUMNS  PER  PAGE  (ABOUT  200-250  VALUES  PER 
PAGE).  (SMO) 


2.10.2.  DOCUMENTATION  REFERENCES 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
DISTRIBUTION  PRINT  PROGRAM,  DIP. 

DIP  REPORT  RETRIEVAL  COMMANDS  BS*S.DOC  DIP. 
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2.11.  FMPS  IFUNCTIONAL  MATHEMATICAL  PROGRAMMING  SYSTEM) 


2.11.1.  ABSTRACT 


FMPS  IS  A MATHEMATICAL  PROGRAMMING  SYSTEM  SUPPLIED  BY 
SPERRY-UNI VAC  FOR  USE  ON  THE  UNIVAC  1108.  THE  PROGRAM  PROVIDES  ALL 
THE  PROCEDURES  COMMONLY  USED  TO  SOLVE  LINEAR  PROGRAMMING  PROBLEMS 
AND  HAS  THE  CAPABILITY  TO  SOLVE  LARGE  PROBLEMS.  ADDITIONALLY,  THE 
PROGRAM  HAS  THE  FACILITY  TO  PRODUCE  USER-DESIGNED  OPTIMIZATION 
PROCEDURES  THROUGH  THE  USE  OF  A USER-ORIENTED  CONTROL  LANGUAGE 
WHICH  RESEMBLES  THE  FORTRAN  LANGUAGE.  (SMS) 


2.11.2.  DOCUMENTATION  REFERENCES 


UNIVAC  SYSTEMS  REFERENCE  LIBRARY.  SPERRY  UNIVAC  1100  SERIES 

FUNCTIONAL  MATHEMATICAL  PROGRAMMING  SYSTEM  (FMPS)  PROGRAM- 
MER REFERENCE.  UP-8198,  SPERRY  RAND  CORPORATION,  197S. 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
FMPS. 

FMPS  REPORT  RETRIEVAL  COMMAND:  8T*T.D0C  FMPS. 
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2.12.  GEN-H l ER-GRP  (GENERATE  HIER-GRP  INPUT  DATA) 


2.12.1.  ABSTRACT 


GEN-HIER-GRP  IS  A UTILITY  PROGRAM  WHICH  WILL  GENERATE  H I ER-GRP 
(SEE  SECTION  2 • I M FOR  A DESCRIPTION  OF  HIER-GRP)  CONTROL  CARDS  AND 
DATA  USING  THE  INFORMATION  CONTAINED  ON  A TRICOR  (CORRELATION  AND 
REGRESSION  PROGRAM)  REGRESSION  OUTPUT  FILE.  (SMS) 


2.12.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
GEN-HIER-GRP. 

GEN-HIER-GRP  REPORT  RETRIEVAL  COMMAND:  0T*T.DOC  GEN-HIER-GRP. 
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2.13.  GUTTMAN  (GUTTMAN  ITEM  ANALYSIS) 


2.13.1.  ABSTRACT 


THE  GUTTMAN  ITEM  ANALYSIS  PROGRAM  IS  AN  ADAPTATION  OF  THE 
BASIC  PROGRAM  OBTAINED  BY  DR.  MALCOLM  REE  (AFhRL/PE)  FROM  THE 
RESEARCH  OFFICE  OF  THE  SCHOOL  DISTRICT  OF  PHILADELPHIA.  GUT  THAN  • 5 
SCALING  TECHNIQUE  IS  DESIGNED  TO  BE  USED  WITH  MUL T I P LE -CHO I CE 
TESTS  FOR  WHICH  THERE  ARE  NO  PREDETERMINED  CORRECT  RESPONSES,  AND 
THEREFORE,  NO  GUIDELINES  FOR  DIFFERENTIALLY  WEIGHTING  THE  ITEM 
RESPONSE  ALTERNATIVES.  THIS  PROGRAM  USES  A MODIFIED  VERSION  OF 
THE  GUTTMAN  WEIGHTING  PROCEDURE  KNOWN  AS  THE  METHOD  OF  RECIPROCAL 
AVERAGES  (RAVE).  THIS  PROCEDURE,  WHICH  WAS  IMPLEMENTED  BY  BAKER 
AND  RAGSDALE  (196*4)  EMPLOYS  AN  INTERATIVE  APPROACH  IN  WHICH 
PREDETERMINED  (INITIAL)  WEIGHTS  FOR  THE  ITEM  RESPONSE  CATEGORIES 
ARE  USED  AS  A SCALING  KEY  TO  COMPUTE  A TOTAL  SCORE  FOR  EACH 
SUBJECT.  THESE  SCORES  AND  THE  ITEM  RESPONSE  CATEGORIES  ARE  THEN 
USED  TO  DERIVE  A NEW  SET  OF  ITEM  OPTION  WEIGHTS,  WHICH  ARE 
EMPLOYED  To  COMPUTE  TOTAL  SCORES  FOR  EACH  SUBJECT.  THIS  PROCEDURE 
MAY  BE  REPEATED  UNTIL  SUCCESSIVE  SETS  OF  ITEM  OPTION  WEIGHTS  DO 
NOT  DIFFER  APPRECIABLY.  THE  ULTIMATE  SET  OF  WEIGHTS  MAXIMIZES  THE 
INTERNAL  CONSISTENCY  INDEX  OF  TEST  RELIABILITY  FOR  THE  GROUP  OF 
SUBJECTS  ON  THAT  INSTRUMENT.  ITERATIVE  OUTPUT  INCLUDES  THE  NEW 
ITEM  OPTION  WEIGHTS,  MEAN  SQUARES  FOR  ERROR  AND  SUBJECT  VARIANCE, 
HOYT’S  INTERNAL  CONSISTENCY  COEFFICIENT,  OPTIONAL  TABLES  OF  THE 
DIFFERENCE  BETWEEN  SETS  OF  WEIGHTS  ON  SUCCESSIVE  ITERATIONS,  AND 
OPTIONAL  VALIDITY  COEFFICIENTS  BETWEEN  THE  EXTERNAL  (AND/OR 
INTERNAL)  CRITERION  ANO  THE  ITERATION  SCORES.  (SMS) 


2.13.2.  DOCUMENTATION  REFERENCE 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  l UNPUBL I SHED  ) J TITLE  - 
GUTTMAN. 

GUTTMAN  REPORT  RETRIEVAL  COMMAND:  WT*T.DOC  GUTTMAN. 
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2.1*4.  HIER-GRP  (REGRESSION  EQUATION  GROUPING) 


2.14.1.  ABSTRACT 


HIER-GRP  IS  BASED  UPON  THE  MATHEMATICAL  THEORY  IN  THE  AIR 
FORCE  PUBLICATION  W A D D - T N - 6 1 - 3 0 BY  BOTTENBERG  AND  CHRISTAL 
ENTITLED  -ITERATIVE  TECHNIQUE  FOR  CLUSTERING  CRITERIA  WHICH 
RETAINS  OPTIMUM  PREDICTIVE  EFFICIENCY,"  AND  HAS  AS  ITS  OBJECTIVE 
TO  GROUP  OR  CLUSTER  REGRESSION  EQUATIONS.  HIER-GRP  IS  DESIGNED  TO 
REDUCE  A SET  OF  REGRESSION  EQUATIONS  TO  A SINGLE  EQUATION.  IN  THE 
PROCESS,  A TAXONOMY  OF  REGRESSION  EQUATIONS  RESULTS,  BASED  ON  THE 
SIMILARITY  BETWEEN  THE  SYSTEMS.  THE  ITERATIVE  PROCESS  BEGINS  WITH 
A GIVEN  NUMBER  OF  SEPARATE  REGRESSION  EQUATIONS  AND  AT  EACH 
SUCCESSIVE  ITERATION  FORCES  A COMPROMISE  EQUATION  TO  BE 
SUBSTITUTED  FOR  TWO  OF  THE  SEPARATE  SYSTEMS.  AT  EACH  ITERATION  A 
NEW  CLUSTER  OF  REGRESSION  SYSTEMS  IS  FORMED,  OR  AN  EXISTING 
CLUSTER  IS  ENLARGED  SO  THAT  IN  THE  FINAL  STEP,  ALL  EQUATIONS  FORM 
A SINGLE  BROAD  CLUSTER.  THE  CRITERION  FOR  SELECTING  THE  TWO 
EQUATIONS  TO  BE  REPLACED  IS  SPECIFIED  BY  THE  USER  AT  THE  START  OF 

the  program.  the  program  has  six  grouping  criteria  options  from 
which  the  user  may  choose,  but  the  option  normally  selected  is  the 
ONE  WHICH  minimizes  the  overall  loss  in  predictive  efficiency. 

the  hier-grp  algorithm  requires  that  all  the  regression 
equations  be  least-squares  solutions  derived  from  proportional 

PREDICTOR  SUMS  OF  CROSS  PRODUCTS  MATRICES  (I.E.,  THE  MEAN  OF  A 
PREDICTOR  FOR  ONE  EQUATION  MUST  EQUAL  THE  MEAN  OF  THE 
CORRESPONDING  PREDICTOR  IN  ALL  OTHER  EQUATIONS.  IN  ADDITION,  THE 
COVARIANCE  MATRICES  FOR  EACH  EQUATION  MUST  BE  IDENTICAL).  THIS  IS 

the  proportionality  assumption  of  bottenberg  and  christal  in 

WADD-TN-6 1-30. 

HIER-GRP  is  limited  to  a maximum  OF  50  regression  equations 
with  a maximum  of  100  predictor  variables  per  regression  equation. 

( SMS  ) 


2.14.2.  documentation  references 


GOTT,  C.D.  HIER-GRP:  A COMPUTER  PROGRAM  FOR  THE  HIERARCHICAL 
GROUPING  OF  REGRESSION  EQUATIONS.  A F H R L - T R - 7 8 - 1 4 , 

A D - A 0 5 8 4 15.  BROOKS  A F B , Tx:  COMPUTATIONAL  SCIENCES  DIVI- 
SION, AIR  FORCE  HUMAN  RESOURCES  LABORATORY,  JUNE  1978. 
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AFHRL  UNIVaC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED) TITLE 

H 1 er-grp • 

hier-grp  report  retrieval  command:  stm.doc  hier-grp. 
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2.15.  H 1 T A B 8 (HIT-TABLES  FOR  DICHOTOMOUS  CRITERIA) 


2.15.1.  ABSTRACT 


H I T A B 4 IS  A UTILITY  PROGRAM  USED  TO  SUPPLEMENT  THE  TRICOR 
CORRELATION  AND  REGRESSION  PROGRAM  BY  DETERMINING  THE  ACCURACY  OF 
PREDICTION  OF  DICHOTOMOUS  CRITERIA.  THIS  IS  ACCOMPLISHED  BY 
COMPUTING  BINARY  CLASSIFICATION  TABLES  FOR  THE  DICHOTOMOUS  (I/O) 
CRITERIA  USING  SPECIAL  VERSIONS  OF  THE  PERSUB  TABDEV  AND  HITS 
SUBROUTINES  (SEE  SECTION  3.3  FOR  A DESCRIPTION  OF  PERSUB,  TABDEV, 
AND  HITS).  THE  INPUT  FILES  ARE  PREDICTED  SCORE  OUTPUT  FILES  AS 
GENERATED  BY  THE  TRICOR  PROGRAM  FOR  EITHER  THE  VALIDATION  (l.E. 
REGRESSION)  SAMPLE  OR  BOTH  THE  VALIDATION  AND  THE  CROSS-VALIDATION 
SAMPLES.  IF  THE  CROSS-VALIDATION  PREDICTED  SCORES  FILE  IS  ALSO 
USED.  THE  PROGRAM  WILL  APPLY  THE  CUTOFF  SCORE  ESTABLISHED  BY  THE 
VALIDATION  SCORES  TABLE  DEVELOPMENT  TO  THE  CROSS-VALIDATION 
SCORES.  (SMS) 


2.15.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVaC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
H I T A B 8 . 

HITA88  REPORT  RETRIEVAL  COMMAND:  BT*T.DOC  HITABS. 
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2.16.  I A P (ITEM  ANALYSIS  OF  ACHIEVEMENT  TYPE  TESTS) 


2.16.1.  ABSTRACT 


I A P IS  AN  ITEM  ANALYSIS  COMPUTER  PROGRAM  FOR  M UL T I P L E - C H 0 I C E 
ACHIEVEMENT  TYPE  TESTS  WITH  UP  TO  200  ITEMS,  THE  BASIC  CONCEPT  IN 
I A P MAKES  USE  OF  THE  ITEM  CHARACTERISTIC  CURVE  AND  ITS  ASSOCIATED 
PARAMETERS.  XSO  AND  BETA  (TUCKER,  i9M61  BAkER,  196S).  THE 
ANALYSES  PERFORMED  BY  IAP  ENABLE  THE  USER  To  (1)  CHOOSE  ITEMS 
WHICH  HAVE  OPTIMUM  DISCRIMINATION  POWER  AT  A CERTAIN  ABILITY 
LEVEL!  (2)  SCREEN  OFF  A CERTAIN  PERCENTAGE  OF  A GROUP  OF  EXAMINEES 
(3)  ESTIMATE  THE  TRUE  SCORE  OF  AN  INDIVIDUAL!  AND  ( R ) COMPUTE  THE 
PROBABILITY  OF  A CORRECT  RESPONSE.  THE  CRITERION  UPON  WHICH  THE 
PROGRAM  BASES  ALL  THE  STATISTICAL  ANALYSES  MAY  BE  EITHER  THE  TOTAL 
TEST  SCORE,  WHICH  WILL  BE  CORRECTED  FOR  GUESSING  IE  DESIRED,  OR  A 
USER-SPECIFIED  CRITERION.  THE  TEST  MAY  BE  TREATED  EITHER  AS  A 
POWER  TEST,  in  which  CASE  the  ANALYSIS  OF  EACH  item  IS  BASED  ON 
THE  TOTAL  SAMPLE,  OR  AS  A SPEED  TEST,  IN  WHICH  CASE  ONLY  THOSE 
REACHING  A PARTICULAR  ITEM  WILL  BE  CONSIDERED  IN  THE  ANALYSIS  OF 
THAT  ITEM.  IN  ADDITION,  THE  USER  MAY  REQUEST  A FACTOR  ANALYSIS  OF 
THE  TETRACHORIC  INTER-ITEM  CORRELATION  MATRIX.  THE  PRINTED  OUTPUT 
INCLUDES  FOR  EACH  ITEM  ALTERNATIVE  THE  PROPORTION  OF  SUBJECTS 
CHOOSING  THE  ALTERNATIVES!  MEANS!  STANDARD  DEVIATIONS!  BISERIAL 
AND  POINT  BISERIAL  CORRELATIONS  WITH  STATISTICAL  TESTS  OF 

significance;  item  difficulty;  discrimination  index  (betai; 

ABILITY  LEVEL  AT  WHICH  THE  ITEM  DISCRIMINATES  (XSO)  : FREQUENCY 

distribution;  phi  coefficients  (optional);  identification  of  too 

EASY  OR  TOO  DIFFICULT  ITEMS  (AS  SPECIFIED  BY  THE  USER);  AND  PLOTS 
OF  the  Item  CHARACTERISTIC  CURVES  (OPTIONAL).  options  are  also 
available  to  correct  for  item/test  correlation  overlap  and  for 

SCORING  ITEMS  HAVING  MORE  THAN  ONE  ALTERNATIVE  DESIGNATED  AS  THE 
CORRECT  ANSWER.  (SMS) 


2.16.2.  DOCUMENTATION  REFERENCE 


koplyay,  j.b.  REFERENCE  MANUAL  for  iap;  computer  based  item 
ANALYSIS  PROGRAM  for  ANALYZING  ACHIEVEMENT  TYPE  TESTS 
AND/OR  ITEMS.  UNPUBLISHED  MANUSCRIPT,  197M. 
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2.17.  IAP&  (ANALYSIS  OF  QUESTIONNAIRE  TYPE  TESTS! 


2.17.1.  ABSTRACT 


1APG  IS  A VERSATILE  SERIES  OF  ITEM  ANALYSIS  COMPUTER  PROGRAMS 
WHICH  ARE  TO  BE  USED  AS  FIELD  PREDICTION  INSTRUMENTS.  1APG 
PRINTED  RESULTS  ENABLE  THE  USER  TO  CREATE  A COMPOSITE  (SCALE) 
WHICH  OPTIMALLY  PREDICTS  A GIVEN  CRITERION  AND  WHICH  CAN  BE 
QUICKLY  AND  EASILY  SCORED  BY  HAND.  TO  FACILITATE  HAND  SCORING, 
THE  COMPOSITE  SCORE,  I.E..  THE  WEIGHTED  SUM  OF  SCORES  FOR  THE 
QUESTIONNAIRE  ITEMS,  IS  COMPUTED  USING  ONLY  UNIT  (*1  OR  -II 
WEIGHTS  FOR  EACH  ITEM.  THE  INPUT  CONSISTS  OF  RESPONSES  TO  ITEMS, 
SUCH  AS  BIOGRAPHICAL,  INTEREST,  OR  OPINION-TYPE  QUESTIONS.  THE 
DATA  SET  OF  RESPONSES  MUST  BE  DIVIDED  INTO  TWO  OR  THREE 
SUBSAMPLES.  THE  SELECTION  OF  more  THAN  ONE  ALTERNATIVE  OF  ANY 
ITEM  IS  NOT  ALLOWED.  IF  DESIRED,  OMITTED  OR  OUT-OF-RANGE 
RESPONSES  CAN  BE  CONSIDERED  AS  AN  ALTERNATIVE;  OTHERWISE,  ALL 
CASES  WITH  OMITTED  OR  OUT-OF-RANGE  RESPONSES  ARE  ELIMINATED.  IAPG 
IS  DIVIDED  INTO  TWO  LOGICAL  UNITS,  IAPG/J-H  AND  lAPG/5-7. 
IAPG/I-R  COMPUTES  ITEM  KEYS  (A  KEY  FOR  AN  ITEM  W I TH  K ALTERNATIVES 
IS  AN  ORDERED  SET  OF  K NUMBERS  SUCH  THAT  ALL  INDIVIDUALS 
RESPONDING  TO  THE  I-TH  ALTERATIVE  RECEIVE  THE  1-TH  NUMBER  AS 
THEIR  (KEY)  SCORE  FOR  THAT  ITEM)  AND  ITEM  VALIDITIES  ON  EACH 
SUBSAMPLE,  AND  CROSS-VALIDATES  THE  RESULTS  ON  THE  REMAINING 
SUBSAMPLES.  IAPG/B-7  USES  ITEMS  SELECTED  ON  THE  BASIS  OF  IAPG/1-R 
OUTPUT  TO  FORM  THE  BEST  UN  I T - W E 1 G H T E D PREDICTOR  OF  EACH  SPECIFIED 
CRITERION  ON  EACH  SUBSAMPLE  AND  CROSS-VALIDATES  THE  RESULTS  ON  THE 
REMAINING  SUBSAMPLES.  (SMS) 


2.17.2.  DOCUMENTATION  REFERENCE 


ALbERT,  W . G • , & WHITEHEAD,  L.K.  I A PG  t AN  ITEM  ANALYSIS 

PROGRAM  FOR  QUESTIONNAIRE  TYPE  TEST  INSTRUMENTS.  UNPUBLISHED 
MANUSCRIPT,  1975. 
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2.18.  IAPG-HITS  (HIT-TABLE  FOR  DICHOTOMOUS  CRITERIA) 


2.18.1.  ABSTRACT 


IAPG-HITS  IS  A UTILITY  PROGRAM  USED  TO  SUPPLEMENT  THE 
IAPG/S-7  "ITEM  ANALYSIS  PROGRAM  FOR  QUESTIONNAIRE  DATA"  BY 
DETERMINING  THE  ACCURACY  OF  PREDICTION  OF  DICHOTOMOUS  OR 
ARTIFICIALLY  DICHOTOMIZED  CRITERIA.  THIS  IS  ACCOMPLISHED  BY 
COMPUTING  BINARY  CLASSIFICATION  TABLES  FOR  THE  DICHOTOMOUS  (0/1) 
CRITERIA  USING  SPECIAL  VERSIONS  OF  THE  PERSUB  TABDEV  AND  HITS 
SUBROUTINES  (SEE  SECTION  3.3  FOR  A DESCRIPTION  OF  PERSUB.  TABDEV, 
AND  HITS).  THF.  INPUT  FILES  ARE  THE  "1SSP"  (ITEM  SELECTION  SEQUENCE 
FILE)  AND  THE  " K I RF " (KEYED  ITEM  RESPONSE  FILE)  GENERATED  BY  THE 
IAPG/S-7  PROGRAM.  THE  CRITERION  MAY  BE  EITHER  DICHOTOMOUS  OR 
CONTINUOUS.  HOWEVER.  THE  CONTINUOUS  CRITERION  WILL  BE  DICHOTOMIZED 
ACCORDING  TO  A USER  SUPPLIED  CUTOFF  VALUE.  OTHER  OPTIONS  INCLUDE  A 
FILE  CHARACTERISTICS  LIST  AND  A PRINT  OF  THE  TABLE  DEVELOPMENT  FOR 
THE  1 00  CUTOFF  POINTS  BETWEEN  I AND  0. 

the  number  of  cases  is  limited  to  2000  per  sample,  while  the 

NUMBER  OF  ITEMS  INCLUDING  DUMMIES,  MUST  BE  800  OR  LESS. 

( SMS  ) 


2.18.2.  DOCUMENTATION  REFERENCE 


AFHRL  UN1VAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE 
IAPG-HITS. 

IAPG-HITS  REPORT  RETRIEVAL  COMMAND:  6T*T.D0C  IAPG-HITS. 
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2.19.  MANoVA  (MULTIVARIATE  ANALYSIS  OF  VARIANCE) 


2.19.1.  ABSTRACT 


MANOVA  IS  COMPOSED  OF  TWO  PR 0 GR A MS  , " B M D X 6 9 X " AND  " X 6 9 H T , " 
WHICH  TOGETHER  WILL  SOLVE  A VARIETY  OF  MULTIVARIATE  ANALYSIS  OF 
VARIANCE  AND  COVARIANCE  PROBLEMS. 

BMQX69X : 

THIS  PROGRAM,  WHICH  IS  A MODIFICATION  OF  THE  BMD  PROGRAM 
BMDX69,  WILL  PERFORM  MODEL  1 MULTIVARIATE  ANALYSIS  OF  VARIANCE  OF 
VARIANCE  OR  COVARIANCE  FOR  ANY  HIERARCHICAL  DESIGN  WITH  EQUAL  CELL 
SIZES.  THIS  INCLUDES  NESTED,  PARTIALLY  NESTED  AND  CROSSED,  AND 

fully  crossed  designs,  the  design  is  specified  by  indicating  the 

NESTING  RELATIONSHIPS  OF  THE  INDICES.  SEVERAL  ANALYSES  MAY  BE 

PERFORMED  for  each  problem  by  specifying  different  dependent 
variables  or  covariates. 

X 6 9 H t : 

THIS  PROGRAM  WILL  TAKE  THE  COVARIANCE  MATRICES  FILE  GENERATED 
BY  B MD  X fc  9 x AND  COMPUTE  A NEW  MANOVA  ANALYSIS  WITH  AN  ERROR  MATRIX 
GENERATED  FROM  SPECIFIED  ERROR  COMPONENTS.  UNIVARIATE  TESTS  MAY 
BE  PERFORMED  FOR  EACH  DEPENDENT  VARIABLE. 

THE  MANOVA  PACKAGES  ARE  LIMITED  TO  10  ANALYSIS  OF  VARIANCE 
INDICES  (FACTORS)  AND  MU  DEPENDENT  VARIABLES.  CORE  USED  BY  THE 

bmdx69x  package  is  variable  and  is  acquired  as  needed  by  the 

PROGRAM.  (SMS) 


2.19.2.  DOCUMENTATION  REFERENCE 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
MANOVA  . 

MANOVA  REPORT  RETRIEVAL  COMMAND:  6 T * T . D 0 C MANOVA. 
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2.20.  MAX-FACTOR  (FACTOR  ANALYSIS) 


2.20.1.  ABSTRACT 


MAX-FACTOR  IS  A FACTOR  ANALYSIS  PROGRAM  FOR  PROBLEMS  HAVING 
UP  TO  400  VARIABLES.  THE  POWER  METHOD  IS  USED  TO  COMPUTE  THE 
FACTORS  ONE  AT  A TIME  IN  DESCENDING  ORDER  ACCORDING  TO  THE 
PERCENTAGE  OF  THL  TOTAL  VARIANCE  EACH  FACTOR  EXPLAINS.  THIS 
FEATURE  ENABLES  THE  USER  TO  COMPUTE  ONLY  AS  MANY  FACTORS  AS  ARE 
REQUIRED.  ADDITIONAL  FACTORS  MAY  BE  CALCULATED  BY  RESTARTING  THE 
PROGRAM  AT  THE  PRIOR  TERMINATION  POINT.  THE  INPUT  CONSISTS  OF  THE 
UPPER  TRIANGULAR  PORTION  OF  A CORRELATION  MATRIX.  THE  DIAGONAL 
ENTRIES  MAY  BE  ONES,  IN  WHICH  CASE  A PRINCIPAL  COMPONENTS  ANALYSIS 
IS  PERFORMED,  OR  THE  DIAGONAL  ENTRIES  MAY  BE  USER  SPECIFIED 
ESTIMATES  OF  THE  C 0 M M U N A L I T I E S (SUCH  AS  THE  LARGEST  CORRELATION  IN 
EACH  ROW  OF  THE  MATRIX,  OR  THE  SQUARED  MULTIPLE  CORRELATION 
OBTAINED  FROM  THE  REGRESSION  OF  EACH  VARIABLE  ON  THE  REMAINING 
VARIABLES),  OR  THE  DIAGONAL  ENTRIES  MAY  BE  READ  FROM  A FILE.  THE 

user  may  request  that  the  program  be  used  iteratively  to  improve 

INITIAL  ESTIMATES  OF  THE  C 0 M M U N A L I T I E 5 . THE  FACTOR  LOADINGS  CAN 
BE  SAVED  FOR  SUBSEQUENT  USE.  ANY  SUBSET  OF  THE  FACTORS  MAY  BE 
ROTATED  TO  A SIMPLE  STRUCTURE  FORM  USING  THE  VAR1MAX  CRITERION. 
IF  DESIRED,  THE  FACTOR  LOADINGS  CAN  BE  SORTED  ACCORDING  TO 
ABSOLUTE  VALUE.  (SMS) 


2.20.2.  DOCUMENTATION  REFERENCE 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)!  TITLE  - 
MAX-FACTOR  . 

MAX-FACTOR  REPORT  RETRIEVAL  COMMAND!  ST*T.DOC  MAX-FACTOR. 
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2.2 1.  MEAnSD  (MEAN  & STANDARD  DEVIATION  PROGRAM) 


2.21.1.  ABSTRACT 


THE  PURPOSE  OF  THIS  PROGRAM  IS  TO  ENABLE  THE  USER  TO 

CALCULATE  THE  MEAN  AND  STANDARD  DEVIATION  OF  THE  MINOR  VALUES  FOR 

EACH  CHANGE  IN  THE  MAJOR  VALUE  ON  A DIG  FREQUENCY  FILE. 

ALL  CALCULATIONS  ARE  PERFORMED  IN  DOUBLE  PRECISION  ARITHMETIC 
IN  ORDER  TO  RETAIN  THE  MAXIMUM  ACCURACY  IN  THE  RESULTS. 

EACH  DISTRIBUTION  ON  THE  INPUT  FILE  TO  BE  EXTRACTED  FOR 

COMPUTATIONS  IS  IDENTIFIED  BY  A SEPARATE  CONTROL  CARD.  AN  OUTPUT 
RECORD  FOR  EACH  CHANGE  IN  THE  MAJOR  FIELD  WITHIN  EACH  DISTRIBUTION 
IS  WRITTEN.  EACH  OUTPUT  RECORD  CONTAINS 

A.  THE  DISTRIBUTION  IDENTIFIER, 

B.  MAJOR  FIELD  VALUE, 

C.  MEAN, 

D.  STANDARD  DEVIATION, 

E.  VALID-N  (CASES  WITH  ALL  NUMERIC  DATA), 

F.  TOTAL-N  ( ALL  CASES  I , 

G.  Sum  OF  X,  AND 

H.  Sum  OF  X-SQUARED. 

THE  OUTPUT  FILE  MAY  BE  USED  AS  INPUT  TO  ADDITIONAL 
STATISTICAL  ANALYSES  OR  REPORTED  BY  STANDARD  REPORT  WRITING 
PROGRAMS  SUCH  AS  VIRUS.  (SMA) 


2.21.2.  documentation  reference 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)’.  TITLE  - 
SMAZ  UTILITY  PROGRAMS  & PROCESSORS. 

REPORT  RETRIEVAL  COMMAND!  BZ*2.D0C. 
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2.22.  MULT-D1SCR  (MULTIPLE  DISCRIMINANT  ANALYSIS) 


2.22.1.  ABSTRACT 


multi-discr  IS  A MAJOR  MODIFICATION  OF  DISCRM  FROM  D.J. 
VELDMAN'S  "FORTRAN  PROGRAMMING  FOR  THE  BEHAVIORAL  SCIENCES",  AND 
WILL  PERFORM  MULTIPLE  DISCRIMINANT  ANALYSIS  FOR  UP  TO  TEN  (AND  IN 
SOME  CASES  MORE)  GROUPS  HAVING  UP  TO  t 00  VARIABLES.  THE  DYNAMIC 
ALLOCATION  OF  CORE  ALLOWS  FOR  MORE  THAN  TEN  GROUPS  AND  100 
VARIABLES.  THE  PROGRAM  HAS  AN  OPTION  TO  COMPUTE  THE  AMOUNT  OF  CORE 
REQUIRED  FOR  A PARTICULAR  PROBLEM  SO  THE  USER  MAY  DETERMINE 
WHETHER  HIS  PROBLEM  MAY  BE  RUN.  A PROBLEM  WITH  h?  VARIABLES  AND  M 
GROUPS,  READING  A COBOL  FILE,  REQUIRED  93K  TO  RUN. 

OUTPUT  FROM  THE  PROGRAM  INCLUDES  A SERIES  OF  LINEAR 
COMBINATIONS  OF  THE  VARIABLES  CALLED  THE  DISCRIMINANT  FUNCTIONS, 
AS  WELL  AS  A TEST  OF  SIGNIFICANCE  OF  EACH  FUNCTION!  GROUP  AND 
TOTAL  SAMPLE  MEANS,  STANDARD  DEVIATIONS,  AND  CORRELATION  MATRICES 
FOR  THE  ORIGINAL  VARIABLES!  CORRELATIONS  BETWEEN  THE  ORIGINAL 
VARIABLES  AND  THE  DISCRIMINANT  FUNCTIONS!  AND  THE  MEANS  OF  THE 
SCORES  WITHIN  EACH  GROUP  (CENTROIDS). 

A NEW  FEATURE  OF  THE  PROGRAM  IS  THE  ABILITY  TO  COMPUTE  THE 
PROBABILITY  THAT  AN  INDIVIDUAL  BELONGS  TO  ONE  OF  THE  GROUPS  GIVEN 
HIS  VARIABLE  SCORES.  FOR  EACH  GROUP,  A TABLE  IS  PRINTED  SHOWING 
HOW  THE  PROBABILITIES  ARE  DISTRIBUTED  WITHIN  THAT  GROUP.  (SMS) 


2.22.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIvAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)!  TITLE  - 
MULT-DISCR. 


MULT-DISCR  REPORT  RETRIEVAL  COMMAND!  0T*T.DOC  MULT-DISCR. 


27 


MATHEMATICAL  and  statistical  software  index 


2.23.  MULTI ~PR  OB  (MULTIVARIATE  NORMAL  PROBABILITY) 


2.23.1.  ABSTRACT 


PERSONNEL  RESEARCH  IS  FREQUENTLY  CONFRONTED  BY  PROBLEMS  WHICH 
INVOLVE  INVESTIGATION  OF  THE  JOINT  OCCURRENCE  OF  SEVERAL 
ATTRIBUTES  OR  TRAITS  WHICH  ARE  NORMALLY  DISTRIBUTED  IN  A 
POPULATION.  FREQUENTLY,  THE  PROBLEM  IS  ONE  OF  DETERMINING  THE 
PROPORTION  OF  A POPULATION  WHICH  WOULD  BE  EXPECTED  TO  POSSESS 

these  measures:  e.g.,  aptitude  indices  or  the  like,  commonly 

WITHIN  SPECIFIED  RANGES  (CUTOFFS).  THESE  PROPORTIONS  ARE 
REPRESENTED  BY  A SINGLE  NUMBER  WHICH  IS  USUALLY  CALLED  A 
MULTIVARIATE  PROBABILITY. 

THE  PROBLEM  OF  COMPUTING  THESE  PROBABILITIES  WITH  SATISFACTORY 
PRECISION  BEYOND  THE  TWO-VARIATE  CASE  HAS  BEEN,  IN  GENERAL,  AN 
UNSOLVED  ONE.  PREVIOUSLY  PUBLISHED  PAPERS  BY  OWEN  (1956)  AND 
STECK  (1958)  GAVE  METHODS  APPLICABLE  To  THE  TWO-  AND 
THREE-VARIABLE  CASE.  THEIR  METHODS  WERE  MODIFIED  AND  EXTENDED  TO 
FOUR  AND  FIVE  VARIABLES  IN  ORDER  TO  PRODUCE  THE  SYSTEM  OF  DIRECT 
CALCULATION  USED  BY  MULTI-PROB. 

the  MULTI-PROB  program  is  DESIGNED  TO  COMPUTE  multivariate 
NORMAL  PROBABILITIES  FOR  UP  TO  FIVE  JOINTLY  DISTRIBUTED  NORMAL 
VARIABLES  EACH  HAVING  MEAN  0 AND  VARIANCE  OF  1.  GIVEN  THE 
PROPORTIONS  U ( 1 ) ,U ( 2 ),...,  U < N ) , AND  THE  CORRELATION  MATRIX  FOR  A 
SET  OF  RANDOM  VARIABLES  X ( 1 ),  X ( 2 )....,  X ( N ) , EACH  WITH  MEAN  0 AND 
VARIANCE  1 WHOSE  JOINT  DISTRIBUTION  IS  MULTIVARIATE  NORMAL,  THE 
PROGRAM  COMPUTES  THE  PROBABILITY  P ( X ( 1 ) .LE  .U ( 1 ),..., X ( N ). LE  .U ( N ))  . 

the  number  of  variables,  the  correlation  matrix,  and  the  cutoff 
PROPORTIONS  ARE  OBTAINED  BY  CARD  INPUT. 

THE  APPENDIX  OF  THE  MULTI-PROB  DOC  PROCESSOR  REPORT  WILL  BE  OF 
INTEREST  TO  THOSE  WHO  WISH  TO  COMPUTE  MUL T I - V A R I A T E PROBABILITIES 
AS  PART  OF  THEIR  OWN  PROGRAMS.  (SMS) 


2.23.2.  documentation  references 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)!  TITLE  - 
MULTI -PROS . 

MULTI-PROB  REPORT  RETRIEVAL  COMMAND:  «T*T.DOC  MuLTI-PROB. 
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2.21.  REGREX  (STEPWISE  REGRESSION) 


2.21.1.  ABSTRACT 


REGREX  IS  A COMRUTEK-b ASEO  STEPWISE  REGRESSION  ALGORITHM, 
DEVELOPED  IN-HOUSE,  WHICH  HANDLES  UP  TO  200  VARIABLES.  IT  GREW 
OUT  OF  THE  NEED  OF  RESEARCH  SCIENTISTS  FOR  A MORE  VERSATILE, 
HIGHLY  ACCURATE,  AND  FLEXIBLE  PREDICTION  SYSTEM  TO  SUPPLEMENT  THE 
EXISTING  ITERATIVE  ALGORITHM.  FIRST,  REGREX  WILL  AUTOMATICALLY 
IDENTIFY  REDUNDANT  VARIABLES,  ELIMINATE  THEM  FROM  THE  REGRESSION, 
AND  REPORT  THE  PROPER  DEGREES  OF  FREEDOM  FOR  F-TESTS.  SECOND, 
REGREX  ALLOWS  FOR  EXTENSIVE  RESIDUAL  ANALYSIS  INCLUDING  RESIDUAL 
PLOTS,  TO  EXAMINE  THE  FIT  OF  NORMALITY  AND  HO M 0 S C E D A S T I C I T Y . 
THIRD,  REGREX  PROVIDES  APPROPRIATE  REGRESSION  WEIGHTS  AT  EACH 
STEP.  FOURTH,  THE  PRINTED  OUTPUT  IS  MORE  DETAILED,  ALLOWING  MORE 
PRECISE  AND  PROPER  INTERPRETATION  OF  THE  RESULTS*  FIFTH,  SEVERAL 
OPTIONS  ARE  INCORPORATED  INTO  REGREX  WHICH  ALLOW  GREAT  FLEXIBILITY 
FOR  BOTH  RESEARCH  SCIENTISTS  AND  AIR  FORCE  MANAGERS.  ONE  OF  THESE 
OPTIONS  ALLOWS  NEW  VARIABLES  TO  BE  GENERATED  FROM  COMBINATIONS  OF 
EXISTING  VARIABLES.  ANOTHER  OPTION  ALLOWS  PREDICTORS  TO  BE 
"FORCED"  INTO  THE  PREDICTION  SYSTEM  REGARDLESS  OF  THEIR  RELATIVE 
OR  ABSOLUTE  CONTRIBUTION  IN  EXPLAINING  THt  CRITERION  VARIANCE. 
( SMS  I 


2.21.2.  DOCUMENTATION  REFERENCES 


KOPLYAY,  J.B.,  MATHON,  W.S.,  f»  WHITEHEAD,  L • K • REGREX!  A 

COMPUTER  BASED  STEPWISE  REGRESSION  ALGORITHM  WITH  RESIDUAL 
ANALYSIS.  UNPUBLISHED  MANUSCRIPT,  1976. 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
REGREX  . 

REGREX  REPORT  RETRIEVAL  COMMAND:  WT«T.DOC  REGREX. 
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2.25.  THICOR  (CORRELATION  AND  REGRESSION  PROGRAM) 


2.25.1.  ABSTRACT 


TRICOR  IS  A LARGE  MODULAR  PROGRAM  WHICH  PERFORMS  A VARIETY  OF 
STATISTICAL  ANALYSIS  PROCEDURES  FROM  A STANDARD  COBOL  INPUT  FILE. 
THE  OPERATIONS  TO  BE  PERFORMED  ARE  DEFINED  BY  THE  PROGRAMMER  VIA  A 
SET  OF  FREE-FORMAT  COMMANDS.  TRICOR  WILL  PERFORM  A RIGOROUS 
ANALYSIS  OF  THESE  COMMANDS,  AND  IF  NO  ERRORS  ARE  DETECTED,  WILL 
PERFORM  THf  DESIGNATED  OPERATIONS. 

THE  BASIC  OPERATIONS  PERFORMED  BY  TRICOR  ARE 

A.  CONVERT  RAW  DATA  FROM  COBOL  FILES  TO  FORTRAN  FILES  WITH 
A MAXIMUM  OF  30,000  CASES  IN  UP  TO  5U0  SAMPLES, 

B.  COMPUTE  NEW  VARIABLES  USING  THOSE  WHICH  WERE  CONVERTED 
FROM  THE  INPUT  WITH  A MAXIMUM  OF  MOO  VARIABLES, 

C.  COMPUTE  A CORRELATION  MATRIX  WITH  THE  OPTION  OF  SAVING 
IT  ON  A MASTER  FILE, 

D.  COMPUTE  STEPWISE  AND/OR  ITERATIVE  REGRESSION  ANALYSIS 
WITH  THE  OPTION  OF  SAVING  THE  REGRESSION  WEIGHTS, 

E.  COMPUTE  F-ST AT  I ST  ICS  , AND 

F.  COMPUTE  PREDICTED  SCORES  FOR  C R OSS - V A L I D A T 1 0 N . 

ALL  PRINTED  REPORTS  PRODUCED  BY  TRICOR  CONFORM  TO  THE  AFHRL/SM 
STANDARDS  FOR  FINAL  REPORT  PRODUCTS. 

INTERMEDIATE  WORK  FILES  GENERATED  BY  TRICOR  MAY  BE  SAVED  AND 
USED  WITH  OTHER  UTILITY  PROGRAMS  OR  SPECIAL-PURPOSE  PROGRAMS 
DEVELOPEO  BY  APPLICATIONS  PROGRAMMERS.  ( 5 M A ) 


2.25.2.  DOCUMENTATION  REFERENCE 


AFHRL  UN1VAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
UTILITY  CORRELATION  & REGRESSION  SYSTEM. 

REPORT  RETRIEVAL  COMMAND:  0Z«Z.DOC. 
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2.26.  WPRIME  (RESIDUALS  TEST  FOR  NORMALITY) 


2.26*1.  ABSTRACT 


WPRIME  IS  A COMPUTER  PROGRAM  WHICH  PERFORMS  AN  APPROXIMATE 
ANALYSIS  OF  VARIANCE  TEST  FOR  NORMALITY  ON  THE  RESIDUALS  FROM  A 
REGRESSION  PROBLEM.  THE  MATHEMATICAL  THEORY  UNDERLYING  THE  WPRIME 

test  for  normality  is  documented  in  shapiro  and  francia  119721. 

MONTE  CARLO  RESULTS  HAVE  SHOWN  THAT  FOR  MANY  ALTERNATIVE 
DISTRIBUTIONS  THE  WPRIME  TEST  HAS  POWER  AS  GOOD  OR  BETTER  THAN  THE 
FOLLOWING  test  procedures:  SKEWNESS,  KURToSIS,  chi-square. 
DURBIN,  KOLMOGOROV-SM  1 RNOV  , CRAMER-VAN  MILES,  WElGHTEO  CRAMEK-VAN 
MILES,  AND  DAVID'S  U.  THE  PRINCIPAL  DRAWBACK  TO  THE  USE  OF  WPRIME 
IS  THAT  PERCENTAGE  POINTS  ARE  NOT  AVAILABLE  FOR  SAMPLES  OF  SIZE 
GREATER  THAN  99. 

THE  INPUT  FILE  FOR  WPRJME  IS  THE  OUTPUT  FROM  THE  TRICOR 
PREDICTED  SCORE  LINK.  IF  THE  NUMBER  OF  CASES  ON  THE  OUTPUT  FILE 
EXCEEDS  99,  A PRELIMINARY  RUN  MUST  BE  MADE  TO  SELECT  A RANDOM 
SUBSAMPLE  OF  SIZE  99.  (SMS) 


2.26.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVaC  1 1 0 a DOC  PROCESSOR  REPORT  (UNPUBLISHED)!  TITLE  - 
WPR IME. 


WPRIME  REPORT  RETRIEVAL  COMMAND!  WT*T.DOC  WPRIME 
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2.27.  1 W A Y I ONE-WAY  FREQUENCY  DISTRIBUTION  PRINT) 


2.27.1.  ABSTRACT 


THIS  PROGRAM  PRINTS  ONE-WAY  FREQUENCY  DISTRIBUTIONS  GENERATED 
BY  DIG.  FREE  FORMAT  CONTROL  COMMANDS  DIRECT  THE  WAY  IN  WHICH  THE 
PROGRAM  WILL 

A.  SELECT  THE  DISTRIBUTIONS  TO  BE  PRINTED, 

B.  DESIGNATE  VALUES  TO  BE  "VALID"  OR  "INVALID," 

C.  COMPUTE  MEAN,  STANDARD  DEVIATION,  AND  MEDIAN, 

D.  DESIGNATE  THE  RANGE  OF  VALUES  TO  BE  USED  FOR 
COMPUTAT IONS, 

E.  PROVIDE  REPORT  TITLING  INFORMATION,  AND 

F.  NAME  A UTILITY  FILE  LAYOUT  FOR  DISTRIBUTION  HEADINGS. 

1 W A Y REPORTS  ARE  ESPECIALLY  SUITED  FOR  THE  DISPLAY  OF  SURVEY 
DATA. 


THE  PRINTED  REPORTS  PRODUCED  BY  1WAY  CONFORM  TO  THE  AFHRL/SM 
STANDARDS  FOR  FINAL  REPORT  PRODUCTS.  ( S M A ) 


2.27.2.  DOCUMENTATION  REFERENCE 


AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
SMAZ  UTILITY  PROGRAMS  & PROCESSORS. 

REPORT  RETRIEVAL  COMMAND:  8Z*»Z.00C. 
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3.  mathematical  and  statistical  subroutine  systems 


a subroutine  system  consists  of  a collection  of  COMPUTER 
SUBPROGRAMS.  each  of  which  carries  out  specific  functions  and 
adheres  to  a common  set  of  conventions  to  insure  CONSISTENCY  in 

THE  PROGRAMMING  AND  IN  THE  DOCUMENTATION  FOR  THE  SYSTEM.  BECAUSE 
COMPLETE  COMPUTER  PROGRAMS  AS  SUCH  ARE  NOT  PART  OF  A SYSTEM  OF 
THIS  TYPE.  A DRIVER  PROGRAM  MUST  BE  PREPARED  WHICH  CALLS  INTO 
OPERATION  THE  PARTICULAR  SERIES  OF  THESE  SUBROUTINES  NECESSARY  FOR 
A GIVEN  ANALYSIS.  ONE  OF  THE  MAJOR  ADVANTAGES  ASSOCIATED  WITH  THE 
USE  OF  SUBROUTINE  SYSTEMS  IS  THE  GREATER  FLEXIBILITY  THEY  AFFORD 
TO  THE  DESIGN  OF  COMPUTER  PROGRAMS,  PERMITTING  THE  DEVELOPMENT, 
WITH  RELATIVE  EASE,  OF  COMPUTER  PROGRAMS  WHICH  ARE  CUSTOMIZED  TO 
THE  SPECIFIC  NEEDS  OF  A USER.  ADDITIONALLY,  THE  USE  OF  SUBROUTINE 
SYSTEMS  CAN  SPEED  UP  THE  PREPARATION  OF  NEEDED  COMPUTER  PROGRAMS, 
AS  WELL  AS  PROVIDE  THOROUGHLY  DEBUGGED  AND  RELIABLE  PROGRAMMING 
FOR  THE  REQUIRED  ANALYSIS. 

THERE  ARE  THREE  MATHEMATICAL  AND  STATISTICAL  SUBROUTINE 
SYSTEMS  WITHIN  THE  AFHRL  SOFTWARE  LIBRARY!  THE  UNIVAC  MATH-PACK, 
THE  UNIVAC  STAT-PACK,  AND  THE  PERSUB  SYSTEM.  THE  REMAINING 
SECTIONS  OF  THIS  CHAPTER  DEAL  WITH  EACH  SUBROUTINE  SYSTEM 
SEPARATELY.  GIVING  A GENERAL  DESCRIPTION  OF  EACH  SYSTEM  ALONG  WITH 
A LIST  OF  ITS  SUBROUTINES  AND  THE  SOURCES  OF  ITS  DOCUMENTATION. 
DUE  TO  THE  LARGE  NUMBER  OF  SUBROUTINES  BELONGING  TO  EACH  SYSTEM, 
DESCRIPTIONS  OF  THE  INDIVIDUAL  SUBROUTINES  ARE  NOT  PROVIDED  IN 
THIS  REPORT.  IN  MOST  CASES,  THE  FUNCTION  OF  EACH  SUBROUTINE  IS 
GIVEN  BY  ITS  NAME.  A READER  DESIRING  MORE  INFORMATION  ABOUT  A 
SPECIFIC  SUBROUTINE  SHOULD  CONSULT  THE  SUBROUTINE  SYSTEM'S 
DOCUMENTAT I ON . 
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3.1.  UNIVAC  MATH-PACK 


3.1.1.  GENERAL  DESCRIPTION 


THE  UNIVAC  MATH-PACK  SYSTEM  CONTAINS  A COMPREHENSIVE  SET  OF 
70  MATHEMATICAL  SUBROUTINES  DESIGNED  FOR  UNJVAC  LARGE  SCALE 
SYSTEMS.  THE  SYSTEM  PROVIDES  THE  MORE  FREQUENTLY  USED  TECHNIQUES 
IN  NUMERICAL  ANALYSIS,  WITH  EACH  SUBROUTINE  DESIGNED  SO  THAT  THE 
CAPABILITIFS  OF  THE  UNIVAC  LARGE  SCALE  EQUIPMENT  ARE  USED  MOST 
EFFICIENTLY  ( E . G . , WITH  RESPECT  TO  STORAGE  REQUIREMENTS,  INTERNAL 
COMPUTER  SPEED,  AND  ACCURACY),  AND  THE  PROGRAM  PREPARATION 
REQUIRED  PRIOR  TO  CALLING  THE  SUBROUTINE  IS  MINIMIZED.  THE 
MATH-PACK  SUBROUTINES  ARE  CATEGORIZED  INTO  13  GROUPS  AND  ARE 
LISTED  IN  THE  NEXT  SECTION. 


3.1.2.  LIST  OF  SUBROUTINES 


I NTERPOLAT I ON 


GN1NT  - GREGORY-NEWTON  INTERPOLATION 

GNEXT  - GREGORY-NEWTON  EXTRAPOLATION 

GNPOL  - GREGORY-NEWTON  POLYNOMIAL  EVALUATION 

BE  S I N r - BESSEL  INTERPOLATION 

STINT  - STIRLING  INTERPOLATION 

CDINT  - GAUSS  CENTRAL-DIFFERENCE  INTERPOLATION 
AITINT  - AITKEN  INTERPOLATION 
YLGINT  - LAGRANGE  INTERPOLATION 
SPLN1,  SPLN2  - SPLINE  INTERPOLATION 


NUMERICAL  INTEGRATION 


TRAPNI  - TRAPEZOIDAL  RULE 

SIM1N1  - SIMPSON  1/3  RULE 

SIM3NI  - SIMPSON  3/8  RULE 

STEPNI  - VARIABLE  STEP  INTEGRATION 

GENNI  - GENERALIZED  NUMERICAL  QUADRATURE 

DOUBNI  - DOUBLE  INTEGRATION 

LGAUSS  - GAUSS  QUADRATURE  ABSCISSAS  AND  WEIGHTS 
SIMPTS  - SIMPSON  1/3  RULE  ABSCISSAS  AND  WEIGHTS 
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solution  OF  equations 


NEWTIT  - NEWTON-RAPHSON  ITERATION 
WEGIT  - WEGSTEIN  ITERATION 
A I T 1 T - A I TKEN  I TERAT I ON 

ROOT  CP  - REAL  AND  COMPLEX  ROOTS  OF  A REAL  OR  COMPLEX 
POLTNOM 1 AL 


D IFFERENT I AT  I ON 


DERIV1  - FIRST  DERIVATIVE  APPROXIMATION 
DERJV2  - SECOND  DERIVATIVE  APPROXIMATION 
NTHDER  - NTH  DERIVATIVE  OF  A POLYNOMIAL 


POLYNOMIAL  MANIPULATION 


GIVZRS  - POLYNOMIAL  COEFFICIENTS  GIVEN  ITS  ZEROS 
CVALUE  - COMPLEX  POLYNOMIAL  t VALUATION 
POLYX  - REAL  POLYNOMIAL  MULT  1 PL  1 CAT  1 ON 
C POL  Y X - COMPLEX  POLYNOMIAL  MULTIPLICATION 


matrix  manipulation:  real  matrices 


MXADD  - MATRIX  ADDITION 

MXSUB  - MATRIX  SUBTRACTION 

MXTRN  - MATRIX  TRANSPOSITION 

MXSCA  - MATRIX  MULTIPLICATION  BY  A SCALAR 

MXMLT  - MATRIX  MULTIPLICATION 

MXMDIG  - MATRIX  MULTIPLICATION  BY  DIAGONAL  MATRIX  STORED 
AS  A VECTOR 

g jr  - determinant;  inverse;  solution  of  simultaneous 

EQUAT I ONS 

MXHOI  - INVERSE  ACCURACY  IMPROVEMENT 


matrix  manipulation:  complex  matrices 


CMXADO  - MATRIX  ADDITION 

CMXSU0  - MATRIX  SUBTRACTION 

CMXTRN  - MATRIX  TRANSPOSITION 

CMXSCA  - MATRIX  MULTIPLICATION  BY  A SCALAR 


f 
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CMXMLT  - MATRIX  MULTIPLICATION 

CGJR  - COMPLEX  DETERMINANT;  INVERSE!  SOLUTION  OF 
SIMULTANEOUS  EQUATIONS 


matrix  manipulation:  eigenvalues  and  eigenvectors 


TRIDMX  - TR I D l AGONAL l Z AT  1 ON  OF  REAL  SYMMETRIC  MATRIX 
EIGVAL  - EIGENVALUES  OF  TR I D I AGONAL  MATRIX  BY  STURM 
SEQUENCES 

EIGVEC  - EIGENVECTORS  OF  TRIDIAGONAL  MATRIX  BY  WILKINSON'S 
ME  THOO 


matrix  manipulation:  miscellaneous 


DGJK  - DOUBLE-PRECISION  DETERMINANT!  INVERSE  * SOLUTION  OF 
SIMULTANEOUS  EQUATIONS 

PMXTRI  - POLYNOMIAL  MATRIX  T R I A N G U L A R I Z A T I 0 N 
SCALE  - POLYNOMIAL  MATRIX  SCALING 
MXROT  - MATRIX  ROTATION 


ORDINARY  DIFFERENTIAL  EQUATIONS 


EULDE  - EULER'S  METHOD 
HAMDE  - HAMMING'S  METHOD 

INVAL  - INITIAL  VALUES  FOR  DIFFERENTIAL  EQUATION  SOLUTION 
RKDE  - RUNGE-KUTTA  METHOD 
SODE  - SECOND-ORDER  EQUATIONS 

MRKDE  - REDUCTION  OF  MTH-ORDER  SYSTEM  TO  SYSTEM  OF  M 
FIRST-ORDER  EQUATIONS 


SYSTEMS  OF  EQUATIONS 


JACMX  - JACOBI  ITERATION  TO  DETERMINE  EIGENVALUES  AND 
EIGENVECTORS  OF  SYMMETRIC  MATRIX 
HJACMX  - JACOBI  ITERATIONS  TO  DETERMINE  EIGENVALUES  AND 
EIGENVECTORS  OF  HERMITIAN  MATRIX 
LSIMEQ  - SOLUTION  TO  A SET  OF  LINEAR  SIMULTANEOUS 
EQUATIONS 

NSIMEO  - FUNCTIONAL  ITERATION  TO  DETERMINE  SOLUTION  TO 
SET  OF  NON-LINEAR  EQUATIONS 
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CURVE  FITTING 


CFSRIE  - COEFFICIENTS  OF  FOURIER  SERIES  ON  A CONTINUOUS 
RANGE 

FTRANS  - FOURIER  TRANSFORM 

DFSRIE  - COEFFICIENTS  OF  FOURIER  SERIES  ON  DISCRETE  RANGE 
F1TD  - FITTED  VALUE  AND  DERIVATIVE  VALUES  FOR  A LEAST- 
SQUARES  POLYNOMIAL 

ORTHLS  - ORTHOGONAL  POLYNOMIAL  L E A S T - S Q U A R E S CURVE  FITTING 
FITY  - FITTED  VALUES  FOR  A LEAST-SQUARES  POLYNOMIAL 
COEFS  - COEFFICIENTS  OF  A LEAST-SQUARES  POLYNOMIAL 


PSEUDO  RANDOM  NUMBER  GENERATORS 


NRAND  - INTERVAL  (0,2**27)  GENERATOR 
RANDU  - UNIFORM  DISTRIBUTION 
RANDN  - NORMAL  DISTRIBUTION 
RANDEX  - EXPONENTIAL  DISTRIBUTION 


3.1.3.  DOCUMENTATION  REFERENCES 


UNIVAC  SYSTEMS  REFERENCE  LIBRARY.  UNIVAC  LARGE  SCALE  SYSTEMS 
MATH-PACK  PROGRAM  ABSTRACTS.  UP-M051  REV.  2,  SPERRY  RAND 
CORPORATION,  1970. 

UNIVAC  SYSTEMS  REFERENCE  LIBRARY.  UNIVAC  LARGE  SCALE  SYSTEMS 
MATH-PACK  PROGRAMMERS  REFERENCE.  UP-7SR2  REV.  1,  SPERRY 
RAND  CORPORATION,  1970. 


THE  FIRST  REFERENCE  CONTAINS  A SET  OF  ABSTRACTS  SUMMARIZING 
THE  PURPOSE,  CALLING  SEQUENCE,  AND  METHOD  OF  COMPUTATION  FOR  EACH 
MATH-PACK  SUBROUTINE.  THE  SECOND  REFERENCE  GIVES  MORE  DETAILED 
INFORMATION  CONCERNING  EACH  SUBROUTINE. 
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3.2.  UNIVAC  STAT-PACK 


3.2.1.  GENERAL  DESCRIPTION 


THE  UNIVAC  STAT-PACK  SYSTEM  IS  COMPRISED  OF  70  FUNDAMENTAL 
STATISTICAL  SUBROUTINES  INVOLVING  THE  MORE  FREQUENTLY  USED 
STATISTICAL  TECHNIQUES.  LIKE  THE  SUBROUTINES  IN  THE  UNIVAC 
MATH-PACK  system,  the  stat-pack  subroutines  are  designed  to 

MINIMIZE  THE  PROGRAM  PREPARATION  REQUIRED  PRIOR  TO  CALLING  A 
SUBPROGRAM  AND  TO  OPTIMIZE  ON  A COMBINATION  OF  DESIRABLE  FEATURES 
INCLUDING  INTERNAL  COMPUTER  SPEED,  ACCURACY,  AND  STORAGE 
REQUIREMENTS.  THE  STAT-PACK  SUBROUTINES  ARE  GROUPED  INTO  11 
GENERAL  STATISTICAL  CATEGORIES  AND  ARE  LISTED  IN  THE  FOLLOWING 
SECT  ION. 


3.2.2.  LIST  OF  SUBROUTINES 


DESCRIPTIVE  STATISTICS 


FREQP  - FREQUENCY  POLYGON 
HIST  - HISTOGRAM 

MHIST  - MULTIVARIATE  HISTOGRAM 
GROUP  - GROUPING  OF  DATA 


ELEMENTARY  POPULATION  STATISTICS 


AMEAN  - ARITHMETIC  MEAN 

GMEAN  - GEOMETRIC  MEAN 

HMEAN  - HARMONIC  MEAN 

MEDIAN  - MEDIAN 

MODE  - MODE 

QUANT  - QUANTILES 

OGIVE  - DISTRIBUTION  CURVE 

IQRNG  - INTERPERCENTILE  RANGE 

RANGE  - RANGE 

MNDEV  - MEAN  DEVIATION 

STDEV  - STANDARD  DEVIATION 

C V A R - COEFFICIENT  OF  VARIATION 

ORDER  - ORDER  AND  RANK  STATISTICS 
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CMONT  - CENTRAL  MOMENTS 
AMONT  - ABSOLUTE  MOMENTS 
CUMLT  - CUMULANTS 

SHPCOR  - SHEPPARD  • S CORRECTIONS 
KURSK  - SKEWNESS  AND  KURTOSIS 


DISTRIBUTION,  FITTING  AND  PLOTTING 


BINOM  - BINOMIAL  DISTRIBUTION 
POISON  - POISSON  DISTRIBUTION 
HYPER  - HYPERGEOMETRIC  DISTRIBUTION 
PNORM  - NORMAL  DISTRIBUTION 
AFSER  - ARNE  FISHER  SERIES 


CHI-SQUARE  TESTS 


CHI21S  - CHI-SQUARE  TEST  OF  SAMPLE  PROPORTION  FOR  ONE 
SAMPLE 

CHI  2 J5  - CHI-SQUARE  TEST  OF  SAMPLE  PROPORTION  FOR  J 
SAMPLES 

CHI2P  - CHI-SQUARE  TEST  OF  FIT  TO  POISSON  DISTRIBUTION 

CHI2N  - CHI-SQUARE  TEST  OF  NORMALITY 

CHISAM  - CHI-SQUARE  TEST  OF  HOMOGENEITY 

CHICNT  - CHI-SQUARE  TEST  FOR  INDEPENDENCE 

GENGOF  - CHI-SQUARE  TEST  OF  GENERAL  GOODNESS  OF  FIT 


SIGNIFICANCE  TESTS 


SIGPRP  - TEST  OF  SIGNIFICANCE  OF  PROPORTION  OF  SUCCESSES 
SIGMN  - TEST  OF  SIGNIFICANCE  OF  A MEAN 

SIGOMn  - TEST  OF  SIGNIFICANCE  OF  THE  DIFFERENCE  BETWEEN 
TWO  MEANS 

SIGDVR  - TEST  OF  SIGNIFICNCE  OF  THE  RATIO  BETWEEN  TWO 
VARIANCES 


CONFIDENCE  INTERVALS 


CFDMKv  - CONFIDENCE  INTERVAL  FOR  THE  MEAN;  KNOWN  VARIANCE 
CFDMUV  - CONFIDENCE  INTERVAL  FOR  THE  MEAN:  UNKNOWN 
VAR  I ANCE 

CFDMSU  - CONFIDENCE  INTERVAL  FOR  THE  DIFFERENCE  BETWEEN 
TWO  MEANS 
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CFDVAR  - CONFIDENCE  INTERVAL  FOR  VARIANCE 
TOLJNT  - TOLERANCE  INTERVALS 


ANALYSIS  OF  VARIANCE 


ANOV1  - ONE-WAY  CROSS  CLASSIFICATION 

ANOV2  - TWO-WAY  CROSS  CLASSIFICATION 

ANOV3  - THREE-WAY  CROSS  CLASSIFICATION 

MISDAT  - MISSING  DATA 

VTRANS  - VARIABLE  TRANSFORMATIONS 

ANOVR0  - RANDOMIZED  BLOCKS 

ANOVLS  - LATIN  SQUARES 

ANOVSP  - SPLIT-PLOT  DESIGN 

AN05SP  - SPLIT-SPLIT  PLOT  DESIGN 

ANOVN2  - TWO-WAY  NESTED  DESIGN 

ANO V N 3 - THREE-WAY  NESTED  DESIGN 

ANOCO  - ANALYSIS  OF  COVARIANCE 

GLH  - GENERAL  LINEAR  HYPOTHESES 


REGRESSION  ANALYSIS 


RESTEM  - STEPWISE  MULTIPLE  REGRESSION 
REBSOm  - BACK  SOLUTION  MULTIPLE  REGRESSION 
CORAN  - CORRELATION  ANALYSIS 


TIME  SERIES  ANALYSIS 


MOVAVG  - MOVING  AVERAGES 

SE  ASH ] - SHI SK IN’S  SEASONALITY  FACTORS 

WEMAV  - WEIGHTED  MOVING  AVERAGES 

TRELS  - TREND  ANALYSIS  BY  LEAST  SQUARES 

VADIMe  - VARIATE  DIFFERENCE  METHOD 

TSFARG  - AUTOREGRESSIVE  MODEL 

GEXSMO  - GENERALIZED  EXPONENTIAL  SMOOTHING 

AUXCOR  - AUTO-CORRELATION  AND  C R 0 S S - C 0 R R E L A T I 0 N ANALYSIS 
POWDEN  - POWER  DENSITY  FUNCTIONS 
RCPROr  - RESIDUAL  PROBABILITIES 


MULTI-VARIaTE  ANALYSIS 


GENVAR  - GENERALIZED  VARIANCE 
DISHOT  - HOTELLING'S  DISTRIBUTION 
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DSQ  - MAHAL  ANOB I S • DISTRIBUTION 
SIGIMn  - SIGNIFICANCE  OF  A SET  OF  MEANS 
DISCRA  - DISCRIMINANT  ANALYSIS 

EACTAN  - FACTOR  AND  PRINCIPAL  COMPONENT  ANALYSIS 


MISCELLANEOUS  SUBROUTINES 


PLOTl  - PLOT  OF  ONE  LINE 
JIM  - MATRIX  INVERSION 

MXTMLT  - LEFT  MULTIPLICATION  OF  A MATRIX  BY  ITS  TRANSPOSE 


3.2.3.  DOCUMENTATION  REFERENCES 


UNIVAC  SYSTEMS  REFERENCE  LIBRARY.  UNIVAC  1104  SYSTEM/1108 
MULT  I -PROCESSOR  SYSTEM  STAT-PACK  PROGRAM  ABSTRACTS. 

UP  - 4 0 <♦  1 REV.  2,  SPERRY  RAND  CORPORATION,  19<>9. 

UNIVAC  SYSTEMS  REFERENCE  LIBRARY.  UNIVAC  LARGE  SCALE  SYSTEMS 
STAT-PACK  PROGRAMMERS  REFERENCE.  UP-7502  REV.  I,  SPERRY 
RAND  CORPORATION,  1970. 


THE  FIRST  REFERENCE  CONTAINS  A SET  OF  ABSTRACTS  SUMMARIZING 
THE  PURPOSE,  CALLING  SEQUENCE,  AND  METHOD  OF  COMPUTATION  FOR  THE 
STAT-PACK  SUBROUTINES.  THE  SECOND  REFERENCE  GIVES  MORE  DETAILED 
INFORMATION  CONCERNING  EACH  SUBROUTINE. 
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3.3.  PERSUB  SYSTEM 


3.3.1.  GENERAL  DESCRIPTION 


THE  PERSUB  SUBROUTINE  SYSTEM  IS  A SET  OF  MATRIX-ORIENTED 
SUBROUTINES  DEVELOPED  DURING  THE  MID-SIXTIES  AT  THE  PERSONNEL 
research  laboratory  <now  a part  of  the  air  force  human  resources 
LABORATORY),  AEROSPACE  MEDICAL  DIVISION,  LACKLAND  AIR  FORCE  BASE, 
TEXAS.  THE  SYSTEM  EVOLVED  FROM  WORK  IN  A PARTICULAR  TYPE  OF  DATA 
ANALYSIS.  MANY  OF  THE  SUBROUTINES  IN  PERSUB  INCORPORATE  CLASSICAL 
STATISTICAL  PROCESSES  WHICH  ARE  USED  IN  PERSONNEL  RESEARCH  (E.G., 
CORRELATION,  REGRESSION,  FACTOR  ANALYSIS,  MATRIX  MANIPULATION, 
ETC.),  AND  THE  SYSTEM  WAS  DEVELOPED  PRIMARILY  FOR  THE  PURPOSE  OF 
PROVIDING  THE  RESEARCHER  WITH  A MAXIMUM  OF  FLEXIBILITY  IN 
DESIGNING  A SEQUENCE  OF  ANALYSES  TO  BE  CARRIED  OUT  ON  RESEARCH 
DATA.  THE  ORIGINAL  PERSUB  SYSTEM,  ALONG  WITH  OTHER  SUBROUTINES 
WHICH  WERE  DEVELOPED  LATER,  CONSISTS  OF  6 H SUBROUTINES.  HOWEVER. 
ONLY  36  SUBROUTINES,  THOSE  WHICH  HAVE  A MATHEMATICAL  OR 
STATISTICAL  FUNCTION,  WILL  BE  LISTED  BELOW.  FOR  CONVENIENCE,  THE 
SUBROUTINES  HAVE  BEEN  CATEGORIZED  INTO  SEVEN  GROUPS  ACCORDING  TO 
THE  GENERAL  MATHEMATICAL  OR  STATISTICAL  TECHNIQUE  TO  WHICH  THEY 
RELATE. 


3.3.2.  LIST  OF  SUBROUTINES 


DESCRIPTION  AND  TABULATION 


GRAPH  - TWO-DIMENSIONAL  GRAPH 


MATRIX  ALGEBRA/MANIPULATION 


DETERM  - DETERMINANT  OF  A SYMMETRIC  POSITIVE  DEFINITE 
MATRIX 

EIGEN  - E I GENROOT  AND  EIGENVECTOR  COMPUTATION  FOR  A SYM 
METRIC  MATRIX 

INVERS  - ITERATIVE  MATRIX  INVERSE 

MATADO  - MATRIX  ADDITION 

MATMUL  - MATRIX  MULTIPLICATION 

MATPOW  - MATRIX  POWERING 
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MATTRa  - MATRIX  TRANSPOSE 

PTRANS  - MATRIX  TRANSFORMATION  TO  PRODUCE  AN  ORTHOGONAL 
BASIS  I P-TR ANSFORMAT I ON  I 
WEIGHT  - WEIGHT  ELEMENTS  OF  A MATRIX 

XTRANx  - PREMULTIPLICATION  OF  A MATRIX  BY  ITS  TRANSPOSE 


SIGNIFICANCE  TESTS/PROBABILITT/DISTRIBUTIONS 


B l TRUN 

FSTAT  - 
MUTRUN 

PLEVEl 
RANDCS 
RANEXP 
RBT AFT 

RGAMMa 

SECANT 


- MAXIMUM  LIKELIHOOD  ESTIMATION  FROM  A TRUNCATED 
SAMPLE  OF  THE  MEAN  AND  STANDARD  DEVIATION  OF  A 
BIVARIATE  NORMAL  POPULATION 

F VALUE  ASSOCIATED  WITH  GIVEN  OEGREES  OF  FREEDOM 
AND  PROBABILITY 

- MAXIMUM  LIKELIHOOD  ESTIMATION  FROM  A TRUNCATED 
SAMPLE  OF  THE  MEAN  AND  STANDARD  DEVIATION  OF  A 
TRIVARIATE,  TETRAVARtATE  OR  PENTAVARIATE  NORMAL 
POPUL AT  1 ON 

- PROBABILITY  LEVEL  ASSOCIATED  WITH  GIVEN  F STATIS- 
TIC AND  OEGREES  OF  FREEDOM 

- RANDOM  DEVIATE  GENERATION! 


T I ON 

RANDOM  DEVIATE  GENERATION! 
T I ON 

RANDOM  DEVIATE  GENERATION! 
BUT  I ON 

RANDOM  DEVIATE  GENERATION! 


CHI-SQUARE  DISTRIBU- 


EXPONENTIAL  DISTRIBU- 


BETA,  F OR  T distri- 
GAMMA  DISTRIBUTION 


MAXIMUM  LIKELIHOOD  ESTIMATION  FROM  A TRUNCATED 
SAMPLE  OF  THE  MEAN  AND  STANDARD  DEVIATION  OF  A 
UNIVARIATE  NORMAL  POPULATION 


VARIANCE  ANALY5IS 


HOMVAR  - BOX-SCHEFFE  HOMOGENEITY  OF  VARIANCE  TEST 


CORRELATION  AND  REGRESSION  ANALYSIS 


AKRACY  - PREDICTED  SCORE  ACCURACY  DETERMINATION 
COMPAR  - PREDICTED  SCORE  ANO  ACTUAL  SCORE  COMPARISON 
CORRLB  - CORRELATION  MATRIX 

CORRLD  - CORRELATION  MATRIX  (DOUBLE  PRECISIONI 
HITS  - 2 X 2 HIT  TABLE  (USEFUL  IN  EVALUATING  A REGRESSION 
MODEL  WITH  A DICHOTOMOUS  CRITERION) 

PRE  DSC  - PREDICTED  SCORES  COMPUTATION 
REGREO  - ITERATIVE  REGRESSION 
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HEGREF  - ITERATIVE  REGRESSION  (USEFUL  WHEN  HIGHLY  CORRE- 
LATED VARIABLES  ARE  PRESENT) 

TABDEV  - HIT  TABLE  DEVELOPMENT  (USEFUL  FOR  DEVELOPING  THE 
PREDICTION  SYSTEM  FOR  A DICHOTOMOUS  CRITERION  BY 
IDENTIFYING  THE  PREDICTED  SCORE  CUTTING  VALUE 
WHICH  PRODUCES  THE  SMALLEST  NUMBER  OF  MISCLASS1- 
FICATIONS) 

TCFFNT  - MATRIX  OF  T COEFFICIENTS  (FOR  A GIVEN  SET  OF 

VARIABLES,  THE  I-TH  COLUMN  CONTAINS  THE  REGRES- 
SION COEFFICIENTS  FOR  PREDICTING  VARIABLE  I FROM 
THE  FIRST  l l-l ) VARI ABLESI 


GROUPING  OR  CLUSTERING 


GROUP4  - GROUP  OBJECTS  ON  THE  BASIS  OF  AN  INPUT  MATRIX 
WHICH  CONTAINS  MEASURES  OF  SIMILARITY  OR  DIF- 
FERENCE BETWEEN  ALL  PAIRS  OF  OBJECTS 


FACTOR  ANALYSIS 


COMMUN  - COMMUNAL  I T IES  COMPUTATION  FOR  A FACTOR  LOADINGS 
MATRIX 

PRAXFA  - PRINCIPAL  AXIS  FACTOR  LOADINGS  COMPUTATION 
QTRANS  - SQUARE  ROOT  FACTOR  ANALYSIS  OF  A CORRELATION  MA- 
TRIX 

VARROT  - VARIMAX  ROTATION 


3.3.3.  DOCUMENTATION  REFERENCES 


WARD,  J.H.,  JR.,  BUCHHORN,  J.,  6 HALL,  K . INTRODUCTION  TO 
PERSUR.  PRL-TR-67-3  ( I ) , AD- A AO  S7fi.  LACKLAND  AF  B , TX’. 
PERSONNEL  RESEARCH  LABORATORY,  AEROSPACE  MEDICAL  DIVISION, 
AUGUST  I 9 A 7 , 


WARD,  J.H.,  JR.,  HALL,  K.,  & BUCHHORN,  J.  PERSU8  REFERENCE 
MANUAL.  PRL-TR-A7-3 ( I I I , AD-AAO  S79.  LACKLAND  A F B , T X I 
PERSONNEL  RESEARCH  LABORATORY,  AEROSPACE  MEDICAL  DIVISION, 
AUGUST  I 9 A 7 • 

THE  FIRST  REFERENCE  GIVEN  ABOVE  CONTAINS  FOUR  EXAMPLES  OF  THE 
APPLICATION  OF  THE  PERSUB  SYSTEM  TO  DATA  ANALYSIS  PROBLEMS.  THE 
SECOND  REFERENCE  CONTAINS  DESCRIPTIONS  OF  99  PERSUB  SUBROUTINES 
ALONG  WITH  A LISTING  OF  THE  SOURCE  LANGUAGE  STATEMENTS  FOR  EACH 
SUBROUTINE.  SUBROUTINES  ADDED  TO  PERSUB  AFTER  PUBLICATION  OF  THE 
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ABOVE  TECHNICAL  REPORTS  ARE  DESCRIBED  IN  THE  TWO  AFHRL  UNIVAC  1108 
DOC  PROCESSOR  REPORTS  GIVEN  BELOW.  THE  FIRST  CONTAINS  DESCRIPTIONS 
OF  HOMVAR,  MUTRUN,  RANDCS,  RANEXP,  RBTAFT,  RGAMMa,  AND  SECmNT.  THE 
SECOND  REPORT  DESCRIBES  AKRACT.  COMMUN,  COMPAR,  CORRLD,  HITS, 
PREOSC,  AND  TABDEV. 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED):  TITLE  - 
M I SC  . 

AFHRL  UNIVAC  1108  DOC  PROCESSOR  REPORT  (UNPUBLISHED)*.  TITLE  - 
PERSUB . 

MISC  REPORT  RETRIEVAL  COMMAND:  0T*T.DOC  MISC. 

PERSUB  REPORT  RETRIEVAL  COMMAND:  UT*T.OOC  PERSUB. 
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H.  NATIONALLY  recognized  statistical  PACKAGES 


the  term,  statistical  packag-,  as  applied  in  this  report 

REFERS  TO  A COLLECTION  OF  COMPUTER  PROGRAMS  WHICH  PERFORM  A 
VAHIETY  OF  STATISTICAL  PROCEDURES  AND  WHICH  SHARE  a COMMON  MAIN 
PROGRAM  FOR  HANDLING  INPUT  DATA  AND  FOR  INVOKING  THE  SPECIFIC 
PROCEDURES  OR  PROGRAMS  WHICH  HAVE  BEEN  REQUESTED.  BECAUSE  OF  THE 
ADVANTAGE  OF  THE  HIGH  SIMILARITY  IN  THE  CONVENTIONS  FOR  CONTROL 
CARO  PREPARATION  FOR  THE  VARIOUS  PROGRAMS  IN  A PACKAGE  AND  BECAUSE 
A VARIETY  OF  STATISTICAL  TECHNIQUES  IS  AVAILABLE  WITHIN  ONE 
PACKAGE,  The  USE  OF  STATISTICAL  PACKAGES  AS  OPPOSED  TO  COLLECTING 
SEPARATE  PR06RAMS  HAS  BECOME  WIDESPREAD  THROUGHOUT  THE  NATION'S 
EDUCATIONAL  AND  PROFESSIONAL  COMMUNITIES.  WITHIN  APPROXIMATELY  THE 
LAST  IS  YEARS,  SEVERAL  SUCH  STATISTICAL  P A C K A GE  5 HAVE  BEEN 
DEVELOPED  AND  DISTRIBUTED  NATIONALLY  (E.G.,  BMD  , SPSS.  SAS, 
STAT JOB ) . 

WHILE  PERSONNEL  RESEARCHERS  AT  A F HR  L HAVE  DEPENDED  PRIMARILY  ON 
MATHEMATICAL  AND  STATISTICAL  SOFTWARE  DEVELOPED  IN-HOUSE  AND  WHILE 
SPECIAL-PURPOSE  SOFTWARE  WILL  CONTINUE  TO  BE  DEVELOPED  AS  IT  IS 
NEEDED,  PERSONNEL  RESEARCHERS  SHOULD  BE  AWARE  OF  THE  TWO 
NATIONALLY  recognized  STATISTICAL  PACKAGES  AVAILABLE  IN  THE  AFHRL 
software  LIBRARY.  THESE  MAY  BE  OF  PARTICULAR  VALUE  TO  NEW, 
INCOMING  RESEARCHERS  WHO  HAVE  UNDERGONE  THEIR  TRAINING  AT 
INSTITUTIONS  WITH  THESE  PACKAGES  AND  TO  RESEARCHERS  WHO  WISH  TO 
COMMUNICATE  THE  RESULTS  OF  THEIR  RESEARCH  TO  ThE  EDUCATIONAL  AND 
PROFESSIONAL  COMMUNITIES  IN  TERMS  OF  COMPUTER  ANALYSES  WHICH  HAVE 
BEEN  ACCOMPLISHED  USING  A WIDELY  KNOWN  AND  USED  SOFTWARE  PACKAGE. 

THE  TWO  NATIONALLY  RECOGNIZED  STATISTICAL  PACKAGES  AVAILABLE 
IN  THE  AFHRL  MATHEMATICAL  AND  STATISTICAL  SOFTWARE  LIBRARY  ARE  (1) 
THE  BIOMEDICAL  COMPUTER  PROGRAMS  (BMD  AND  BMDP)  AND  I2)  THE 
STATJOB  SYSTEM,  VERSION  10  RELEASED  IN  APRIL  1976  FROM  THE  MADISON 
ACADEMIC  COMPUTING  CENTER  AT  THE  UNIVERSITY  OF  WISCONSIN.  BMD  AND 
STATJOB  WERE  OBTAINED  THROUGH  THE  USE  PROGRAM  LIBRARY  INTERCHANGE 
(UPLI)  OF  THE  UNIVAC  USERS  GROUP  USE,  INCORPORATED.  THE  FEATURES 
AND  SPECIFIC  CAPABILITIES  OF  EACH  OF  THESE  PACKAGES  WILL  BE 
DISCUSSED  IN  THE  SECTIONS  WHICH  FOLLOW. 
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M.l.  BIOMEDICAL  computer  programs 


9.1.1.  general  description 


THE  BIOMEDICAL  COMPUTER  PROGRAMS  COMPRISE  A SERIES  OF 
STATISTICAL  PROGRAMS  WHICH  HAVE  EVOLVED  AS  A RESULT  OF 

METHODOLOGICAL  RESEARCH  AND  PROGRAMMING  EFFORT  AT  THE  UCLA  HEALTH 
SCIENCES  COMPUTING  FACILITY  SINCE  19S7.  IT  IS  PERHAPS  THE  OLDEST 
AND  THE  MOST  WIDELY  USED  STATISTICAL  PACKAGE  AVAILABLE.  THE 
BIOMEDICAL  COMPUTER  PROGRAMS  WERE  ORIGINALLY  INTENDED  FOR 
RESEARCHERS  IN  THE  HEALTH  SCIENCES,  BUT  THE  PROGRAMS  HAVE  ALSO 
BEEN  USED  IN  A WIDE  RANGE  OF  BIOLOGICAL  AND  BEHAVIORAL 
APPLICATIONS.  ADDITIONALLY,  ThE  PROGRAMS  HAVE  BEEN  USED  FOR 

PROBLEMS  IN  THE  PHYSICAL  SCIENCES,  ALTHOUGH  THE  DEVELOPERS  DID  NOT 
DESIGN  THE  PROGRAMS  TO  SOLVE  THE  HIGHLY  ILL-CONDITIONED  PROBLEMS 
WHICH  ARE  SOMETIMES  ENCOUNTERED  THERE.  THE  BIOMEDICAL  COMPUTER 
PROGRAMS  WERE  DISTRIBUTED  UNDER  THE  NAME  BIMD  AS  EARLY  AS  1961. 
LATER,  THE  BIMD  PROGRAMS  WERE  CONSOLIDATED  AND  EXPANDED  AND  IN 

I 9 6 H , WERE  ISSUED  AS  BMD  PROGRAMS.  BY  196S,  BMD  PROGRAMS  WERE 

supplemented  by  the  bmdx  programs,  and  in  1969,  the  distribution 

OF  THE  B M 0 P PROGRAMS  BEGAN. 

THE  AFHRL  SOFTWARE  LIBRARY  CONTAINS  39  BMD  AND  BMDX  PROGRAMS 
WHICH  WERE  OBTAINED  THROUGH  UP  L 1 • THE  VERSION  DATES  FOR  THESE 
PROGRAMS  RANGE  FROM  MAY  1969  TO  FEBRUARY  1973  AND  CORRESPOND  TO 
The  PROGRAMS  DOCUMENTED  IN  THE  THREE  EDITIONS  of  USER'S  MANUALS 
WHICH  WERE  PUBLISHED  IN  1 9 6 B , 1 970  , AND  1 973.  THE  AFHRL  SOFTWARE 

LIbRARY  ALSO  HAS  THE  AUGUST  1 9 76  VERSION  OF  B M D P • IT  IS  COMPRISED 
OF  THE  30  PROGRAMS  WHICH  ARE  DOCUMENTED  IN  THE  1977  EDITION  OF  THE 

BMUP  users  manual. 

ACCORDING  TO  THE  UCLA  HEALTH  SCIENCES  COMPUTING  FACILITY,  THE 
BMUP  PROGRAMS  ARE  GENERALLY  PREFERRED  TO  THE  BMD  PROGRAMS  SINCE 

they  are  newer,  contain  improved  statistical  and  numerical 
methods,  AND  are  easier  to  use.  but  the  health  sciences  computing 
facility  continues  to  maintain  the  old  bmd  since  it  has  some 
advantages  and  SINCE  some  of  the  bmd  series  programs  do  not  have 
complete  replacements  in  the  bmdp  series  m.e.,  bmdiom,  bmdo3S, 

BMD02T,  B m 0 0 3 T , BMDOHT,  BMD07V,  BMD08V,  BMD10V,  BMD1IV,  BMD12V). 

for  more  information  on  the  use  of  bmd  in  conjunction  with  bmdp, 
the  interested  reader  is  referred  TO  -THE  BMD  and  bmdp  SERIES  of 
STATISTICAL  COMPUTER  PROGRAMS-  BY  JAMES  W.  FRANE  IN  THE  OCTOBER 
1976  ISSUE  OF  COMMUNICATIONS  OF  THE  ACM. 
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THE  BIOMEDICAL  COMPUTER  PROGRAMS  HAVE  STATISTICAL  TITLES  AND 
ARE  GROUPED  TOGETHER  ACCORDING  TO  THE  FOLLOWING  GENERAL 
STATISTICAL  CATEGORIES  OR  CLASSES.* 

CLASS  D - DATA  DESCRIPTION  AND  TABULATION;  DATA  IN 
GROUPS!  PLOTS  AND  HISTOGRAMS 
CLASS  F - FREQUENCY  TABLES 
CLASS  M - MULTIVARIATE  ANALYSIS 
CLASS  R - REGRESSION  ANALYSIS 
CLASS  S - SPECIAL  PROGRAMS 
CLASS  T - TIME  SERIES  ANALYSIS 
CLASS  V - VARIANCE  ANALYSIS. 

THE  FOLLOWING  SECTION  CONTAINS  A LIST  OF  THE  30  B M 0 P PROGRAMS 
AVAILABLE  IN  THE  SOFTWARE  LIBRARY  ALONG  WITH  THEIR  DESCRIPTIONS 
FROM  THE  0MDP-77  MANUAL.  IN  EACH  SECTION  FOLLOWING  THE  BMOP 
PROGRAM  DESCRIPTIONS  ARE  THE  DESCRIPTIONS  OF  THE  SOFTWARE 
LIBRARY'S  HMD  AND  B MD  X PROGRAMS.  (NOTE.  THE  LETTER  "P"  FOLLOWS 
THE  ACRONYM  BMD  IN  THE  PROGRAM  NAME  FOR  "P"  SERIES  PROGRAMS,  WHILE 
THE  LETTER  "X"  IS  USED  FOR  THE  "X"  SERIES  PROGRAMS.  ALSO,  THE  LAST 
LETTER  IN  THE  PROGRAM  NAMES  BELOW  CORRESPOND  TO  THE  STATISTICAL 
CATEGORIES  TO  WHICH  THE  PROGRAMS  BELONG.) 
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R. 1 .2. 

CLASS  D 

BMDP  1 D 


BMDP2D 


BMDP3D 


LIST  OF  PROGRAMS  AND  PROGRAM  DESCRIPTIONS 

- DATA  DESCRIPTION  AND  TABULATION!  DATA  IN  GROUPS! 
PLOTS  AND  HISTOGRAMS 


SIMPLE  DATA  DESCRIPTION 

COMPUTES  UNIVARIATE  STATISTICS,  LISTS  ALL  DATA  OR 
LISTS  SELECTED  CASES  (THOSE  CONTAINING  MISSING 
VALUES  OR  VALUES  OUTSIDE  MINIMUM  OR  MAXIMUM 
LIMITS)  AND  STORES  THE  DATA  IN  A BMDP  FILE.  FOR 
EACH  VARIABLE,  THE  OUTPUT  INCLUDES  MEAN,  STANDARD 
DEVIATION,  STANDARD  ERROR  OF  MEAN,  COEFFICIENT  OF 
VARIATION,  LARGEST  AND  SMALLEST  VALUES,  LARGEST 
AND  SMALLEST  STANDARDIZED  SCORES  (Z-SCORES), 

RANGE,  AND  THE  TOTAL  NUMBER  OF  ACCEPTABLE  VALUES* 
CODES  (VALUES)  OR  INTERVALS  (CATEGORIES)  FOR  A 
VARIABLE  CAN  BE  SPECIFIED  AND  FOR  EACH  SUCH  VARI- 
ABLE P1D  PRINTS  THE  NUMBER  OF  CASES  WITH  EACH  CODE 
OR  IN  EACH  INTERVAL. 

DETAILED  DATA  DESCRIPTION 

COUNTS  AND  LISTS  DISTINCT  VALUES  OF  EACH  VARIABLE 
IN  ASCENDING  ORDER,  AND  COMPUTES  UNIVARIATE 
STATISTICS.  VALUES  OF  VARIABLES  MAY  BE  TRUNCATED 
OR  ROUNDED  BY  DIFFERENT  AMOUNTS  FOR  DIFFERENT 
VARIABLES,  ALL  COMPUTATIONS  ARE  PERFORMED  ON  THE 
TRUNCATED  OR  ROUNDED  VALUES.  FOR  EACH  VARIABLE, 

THE  PROGRAM  PRINTS  THE  NUMBER  OF  ACCEPTABLE 
VALUES,  THE  MAXIMUM,  MINIMUM,  RANGE,  HALF  OF  THE 
INTERQUARTILE  RANGE,  MEAN,  MEDIAN,  MODE,  STANDARD 
DEVIATION,  STANDARD  ERROR  OF  THE  MEAN,  SKEWNESS 
AND  KURTOSIS.  THE  COMPUTED  STATISTICS  ARE  PRINTED 
ON  A LINE  PLOT  (130  CHARACTERS  LONG)!  A HISTOGRAM 
IS  ALSO  PRINTED.  FOR  EACH  DISTINCT  VALUE,  THE 
NUMBER  OF  CASES  WITH  EACH  VALUE,  THE  PERCENT  OF  THE 
CASES  COUNTED  THAT  THIS  NUMBER  REPRESENTS  AND  THE 
CUMULATIVE  PERCENT  OF  CASES  WITH  VALUES  LESS  THAN 
OR  EQUAL  TO  EACH  DISTINCT  VALUE  ARE  PRINTED. 
OPTIONAL  OUTPUT  INCLUDES  THREE  NEW  LOCATION 
EST IMATES. 

COMPARISON  OF  TWO  GROUPS  WITH  T TESTS 
COMPUTES  ONE-SAMPLE  AND  TWO-SAMPLE  T TESTS  AND 
THEIR  ASSOCIATED  PROBABILITY  LEVELS.  CRUDE  HISTO- 
GRAMS, SHOWING  CASE  DISTRIBUTIONS,  ARE  PRESENTED 
WITH  EACH  TEST.  A GROUPING  VARIABLE  THAT  CLASSI- 
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bmdprd 


BMDP5D 


BMDP6D 


FIES  THE  CASE5  INTO  GROUPS  CAN  BE  SPECIFIED.  IF 
THERE  ARE  MORE  THAN  TWO  GROUPS,  THE  PROGRAM  COM- 
PUTES TWO-SAMPLE  T TESTS  FOR  EACH  PaIR  OF  GROUPS! 
TESTS  ARE  COMPUTED  FOR  BOTH  POOLED  AND  SEPARATE 
VARIANCE  ESTIMATES.  EQUALITY  OF  GROUP  VARIANCES 
IS  TESTED.  IF  SEVERAL  VARIABLES  ARE  ANALYZED, 
MAHALANOB1S*  O-SQUARE  AND  HOTELLlNG*S  T-SQUARE  CAN 
BE  REQUESTED.  A ONE-SAMPLE  T TEST  C H < Q I I MU  ■ 03 
IS  COMPUTED  WHEN  THERE  IS  NO  GROUPING  VARIABLE.  A 
ONE-SAMPLE  T TEST  C H < 0 > I MU  * MU101D  OR  A MATCHED 
PAIR  T TEST  MAY  BE  PERFORMED.  THE  OUTPUT  INCLUDES 
MEANS,  STANDARD  DEVIATIONS,  STANDARD  ERROR 
OF  MEANS,  MAXIMUMS  AND  MINIMUMS  FOR  EACH  GROUP. 

AN  F VALUE  FOR  COMPARISON  OF  VARIANCES,  POOLED  AND 
SEPARATE  T VALUES,  AND  TWO-TAILED  PROBABILITY 
VALUE5  FOR  EACH  T AND  F ARE  ALSO  PRINTED. 

SINGLE  COLUMN  FREQUENCIES 

COUNTS  THE  NUMERIC  AND  NON-NUMERIC  CHARACTERS  (SYM- 
BOLS) FOUND  IN  SINGLE  COLUMN  FIELDS.  THE  PROGRAM 
ASSUMES  THAT  A SINGLE  COLUMN  FIELD  IS  READ  FOR  EACH 
VARIABLE  IN  AI  FORMAT.  ALL  KEYPUNCH  CHARACTERS  ARE 
CONSIDERED  LEGAL,  AND  THE  FREQUENCY  OF  EACH  IS  COUN- 
TED SEPARATELY.  P 4 D CAN  BE  USED  TO  LIST  THE  DATA 
AFTER  CERTAIN  CHARACTERS  ARE  REPLACED  BY  SPECIFIED 
SYMBOLS  (E.G.,  NUMBERS  REPLACED  BY  BLANKS).  THIS  IS 
USEFUL  FOR  PRELIMINARY  DATA  SCREENING,  VERIFYING 
APPROPRIATE  CODING,  AND  VERIFYING  (AT  LEAST 
ROUGHLY)  THE  KINDS  OF  DATA  PRESENT. 

HISTOGRAMS  AND  UNIVARIATE  PLOTS 

PRINTS  HISTOGRAMS  AND  OTHER  UNIVARIATE  PLOTS.  FOR 
EACH  PLOT,  CASES  BELONGING  TO  ONE  OR  MORE  GROUPS 
CAN  BE  USED.  IN  THE  PLOTS,  CASES  FROM  DIFFERENT 
GROUPS  CAN  BE  IDENTIFIED  BY  DISTINCT  LETTERS. 

PLOTS  CAN  BE  PRINTED  FOR  ALL  GROUPS  IN  ONE  PLOT  OR 
FOR  EACH  GROUP  INDIVIDUALLY.  THE  SIZE  OF  THE 
PLOTS  CAN  BE  SPECIFIED.  ONE  OR  MORE  OF  THE 
FOLLOWING  PLOTS  ARE  PRINTED  FOR  A VARIABLE!  A 
HISTOGRAM  OR  A CUMULATIVE  HISTOGRAM  WITH  FREQUEN- 
CIES AND  PERCENTAGES  IN  EACH  INTERVAL,  A NORMAL 
PROBABILITY  PLOT,  A HALF-NORMAL  PROBABILITY  PLOT, 

OR  A CUMULATIVE  DISTRIBUTION  PLOT.  THE  TOTAL 
FREQUENCY  COUNT,  MEAN  AND  STANOARD  DEVIATION  (FOR 
EACH  GROUP  IN  A PLOT)  ARE  PRINTED. 

BIVARIATE  (SCATTER)  PLOTS 

PRINTS  BIVARIATE  SCATTER  PLOTS.  CASES  BELONGING 
TO  ONE  OR  MORE  GROUPS  CAN  BE  USED  FOR  EACH  PLOT. 

THE  PLOTS  MAY  HAVE  FREQUENCIES  PLOTTED  FOR  EACH 
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BMDP7D 


BMDP8D 


POINT,  OR  DISTINCT  LETTERS  MAY  BE  USED  TO  INDICATE 
GROUP  MEMBERSHIP.  PLOTS  CAN  BE  PRINTED  FOR  ALL 
GROUPS  IN  ONE  PLOT  OR  FOR  EACH  GROUP  IN  SEPARATE 
PLOTS.  PLOTS  CAN  BE  SUPERIMPOSED  ONE  ON  THE 
OTHER.  THE  SIZE  OF  THE  PLOTS  CAN  BE  CONTROLLED 
AND  THE  REGRESSION  EQUATIONS  FOR  REGRESSING  X ON  Y 
AND  Y ON  X CAN  BE  REQUESTED!  INTERSECTIONS  OF  THE 
REGRESSION  LINES  WITH  THE  PLOT  FRAME  ARE  INDI- 
CATED. OPTIONAL  STATISTICS  PRINTED  FOR  EACH  PLOT 
(FOR  CASES  USED  IN  THE  PLOT)  ARE  MEAN  AND  STANDARD 
DEVIATION  FOR  EACH  VARIABLE,  AND  EQUATION  OF  SIM- 
PLE LINEAR  REGRESSION  FOR  EACH  VARIABLE  ON  THE 
OTHER  VARIABLE  WITH  THEIR  RESIDUAL  MEAN  SQUARE, 
CORRELATION,  AND  FREQUENCY  OF  CASES. 

DESCRIPTION  OF  GROUPS  (STRATA)  WITH  HISTOGRAMS  AND 
ANALYSIS  OF  VARIANCE 

GROUPS  THE  DATA  INTO  A SPECIFIED  NUMBER  OF  GROUPS 
BASED  ON  THE  VALUE  OF  A GROUPING  VARIABLE.  FOR 
EACH  VARIABLE,  THE  PROGRAM  PRINTS  HISTOGRAMS  FOR 
EACH  GROUP,  SIDE  BY  SIDE.  THE  NUMBER  OF  INTERVALS 
IN  THE  HISTOGRAMS  CAN  BE  SPECIFIED  OR  COMPUTED  BY 
THE  PROGRAM.  MEAN,  STANDARD  DEVIATION,  AND  FRE- 
QUENCIES ARE  REPORTED  FOR  EACH  GROUP  AND  FOR  ALL 
GROUPS  COMBINED.  A ONE-WAY  OR  TWO-WAY  ANALYSIS  OF 
VARIANCE  IS  COMPUTED.  SELECTED  VALUES  FOR  EACH 
VARIABLE  CAN  BE  EXCLUDED  FROM  THE  COMPUTATIONS  BY 
SPECIFYING  MISSING  VALUE  CODES,  MAXIMUMS  OR  M I N - 

IMUMS!  however,  these  values  are  displayed  sepa- 
rately IN  THE  HISTOGRAM.  OPTIONAL  OUTPUT  INCLUDES 
A LISTING  OF  INPUT  DATA  AFTER  TRANSFORMATIONS  OR 
AFTER  ORDERING  FROM  LOW  TO  HIGH  ON  A SPECIFIED 
VARIABLE!  CORRELATIONS  FOR  EACH  GROUP  SEPARATELY 
AND  FOR  ALL  GROUPS  COMBINED!  AND  WlNSORIZED  MEANS 
AND  9S*  CONFIDENCE  INTERVALS  FOR  THE  WlNSORIZED 
MEANS  FOR  EACH  GROUP. 

MISSING  VALUE  CORRELATION 

COMPUTES  CORRELATIONS  FOUR  DIFFERENT  WAYS  FROM 
DATA  CONTAINING  MISSING  VALUES.  THESE  COMPUTA- 
TIONS USE  DIFFERENT  CASES  IN  COMPUTING  MEANS  AND 
VARIANCES.  COMPUTATIONS  CAN  BE  PERFORMED  USING 
ALL  ACCEPTABLE  VALUES  --  THE  MEANS  ARE  COMPUTED 
FROM  ALL  ACCEPTABLE  VALUES  AND  THEN  DEVIATIONS 
FROM  THESE  MEANS  ARE  USED  TO  COMPUTE  COVARIANCES 
AND  CORRELATIONS!  ALL  ACCEPTABLE  PAIRS  OF  VALUES 
FOR  COVARIANCES  --  EACH  ELEMENT  OF  THE  COVARIANCE 
MATRIX  IS  COMPUTED  FROM  THE  EXISTING  PAIRS  OF 
VALUES  INVOLVED,  WHICH  ARE  THEN  USED  TO  COMPUTE 
THE  CORRELATIONS!  ALL  ACCEPTABLE  PAIRS  OF  VALUES 


MATHEMATICAL  and  statistical  software 


I NDE  X 


BMDP9D 


BMDX70 
(OR  BMDI3D) 


BMD  X 8 4 
(OR  BMD 1 2D  I 


FOR  CORRELATIONS  --  EACH  ELEMENT  OF  THE  CORREL- 
ATION MATRIX  IS  COMPUTED  FROM  THE  ACCEPTABLE  PAIRS 
OF  VALUES  involved:  OR  ALL  COMPLETE  CASES  -- 
CASES  WITH  ANY  EXCLUDED  CASES  ARE  NOT  USED.  ANY 
OR  ALL  OF  THE  FIRST  THREE  TYPES  OR  THE  FOURTH  TYPE 
CAN  BE  REQUESTED  FOR  EACH  PROBLEM.  CASE  WEIGHTS 
MAY  ALSO  BE  SPECIFIED.  THE  CORRELATION  MATRIX  CAN 
BE  SAVED  IN  A BMDP  FILE  AND  USED  AS  INPUT  TO  OTHER 
PROGRAMS.  P 8 D CAN  ALSO  BE  USED  TO  OBTAIN  A SUB- 
MATRIX OF  THE  CORRELATION  MATRIX.  THE  OUTPUT  CAN 
INCLUDE  THE  MEAN  AND  VARIANCE  OF  EACH  VARIABLE,  A 
FREQUENCY  TABLE  OF  ACCEPTABLE  VALUES  FOR  PAIRS  OF 
VARIABLES,  THE  COVARIANCE  MATRIX,  THE  CORRELATION 
MATRIX,  THE  SUM  OF  WEIGHTS,  THE  MATRIX  OF  MEANS 
AND  THE  MATRIX  OF  VARIANCES. 

MULTIWAY  DESCRIPTION  OF  GROUPS 

PROVIDES  CELLWISE  DESCRIPTIVE  STATISTICS  FOR  CASES 
WHEN  THE  DATA  ARE  CLASSIFIED  SIMULTANEOUSLY  BY 
SEVERAL  GROUPING  VARIABLES.  THE  PROGRAM  GIVES 
CELL  FREQUENCIES,  MEANS  AND  STANDARD  DEVIATIONS 
FOR  EACH  VARIABLE  DESCRIBED.  IF  REQUESTED  , THE 
PROGRAM  YIELDS  THE  SAME  INFORMATION  FOR  SPECIFIED 
MARGINAL  CELLS.  P9D  COMPUTES  A CHI-SQUARE  TEST 
FOR  EQUALITY  OF  ALL  CELL  FREQUENCIES,  AN  F TEST 
FOR  EQUALITY  OF  ALL  CELL  MEANS  (USING  ONLY  NONEMP- 
TY CELLS)  AND  BARTLETT  * S TEST  FOR  HOMOGENEITY  OF 
VARIANCES  (USING  ALL  CELLS  WITH  NONZERO  VARIANCE). 
FOR  EACH  VARIABLE  DESCRIBED,  THE  PROGRAM  ALSO 
PRINTS  A PROFILE  PLOT  INDICATING  HOW  CELL  MEANS 
SHIFT  FROM  CELL  TO  CELL.  IN  THESE  PLOTS  THE 

numbers  printed  in  the  plot  indicate  cell  fre- 
quencies; THE  LOCATION  OF  THE  NUMBERS  INDICATES 
THE  DEVIATION  OF  CELL  MEANS  FROM  THE  GRAND  MEAN 
IN  STANDARD  DEVIATION  UNITS. 


T PROGRAM 

COMPUTES  T-STATISTICS  AND  ASSOCIATED  PROBABILITY 
LEVELS  FOR  THE  EQUALITY  OF  THE  MEANS  OF  TWO 
GROUPS  BASED  ON  POOLED  AND  SEPARATE  VARIANCE  ES- 
TIMATES. AN  F-STATISTIC  AND  ASSOCIATED  PROBA- 
BILITY LEVEL  FOR  THE  EQUALITY  OF  GROUP  VARIANCES 
ARE  ALSO  COMPUTED. 

ASYMMETRIC  CORRELATION  WITH  MISSING  DATA 
COMPUTES  LARGE  CORRELATION  MATRICES  OR  PORTIONS 
OF  LARGE  CORRELATION  MATRICES  FROM  DATA  WITH 
MISSING  VALUES,  USING  ALL  POSSIBLE  COMPLETE  PAIRS 
WITH  BLANKS  DENOTING  MISSING  DATA. 
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- FREQUENCY  TABLES 


TWO-WAY  FREQUENCY  TABLES 

PRINTS  A VARIETY  OF  TwO-WAY  TABLES  AND  PROVIDES 
ESTIMATES  OF  MANY  MEASURES  OF  ASSOCIATION  AND 
PREDICTION.  THE  TABLES  CAN  CONTAIN  OBSERVED  FRE- 
QUENCIES! EXPECTED  VALUES  UNDER  INDEPENDENCE! 
PERCENTAGES  OF  THE  TABLE  TOTAL,  OF  EACH  ROW  TOTAL 
OR  OF  EACH  COLUMN  TOTAL!  OR  SEVERAL  TYPES  OF 
RESIDUALS.  TESTS  AND  STATISTICS  THAT  CAN  BE 
COMPUTED  FOR  EACH  TWO-WAY  TABLE  ARE!  TESTS  OF 
INDEPENDENCE  AND  RELATED  MEASURES  (CHI-SQUARE 
TEST,  THE  LIKELIHOOD  RATIO  CHI-SQUARE  TEST  G- 
SQUARE,  PHI,  THE  CONTINGENCY  COEFFICIENT  AND 
CRAMER’S  V)!  TESTS  OF  INDEPENDENCE  APPROPRIATE 
ONLY  FOR  2X2  TABLES  (FISHER’S  EXACT  TEST  AND 
YATE'S  CORRECTED  CHI-SQUARE)!  MEASURES  OF  ASSOCI- 
ATION APPROPRIATE  FOR  A 2X2  TABLE  (YULE’S  Q AND  Y, 
THE  CROSS-PRODUCT  RATIO  AND  THE  TETRACHOR1C  COR- 
RELATION R ( T ) ) ! MEASURES  OF  ASSOCIATION  AND  COR- 
RELATION WHEN  THE  CATEGORIES  OF  BOTH  FACTORS  ARE 
ORDERED  (GOODMAN  AND  KRUSKAL  GAMMA,  KENDALL’S 
T A U ( 8 ) , STUART’S  TAU(C),  THE  P R 0 D U C T - M 0 M E N T COR- 
RELATION R AND  THE  SPEARMAN  RANK  CORRELATION)! 
PREDICTIVE  MEASURES  WHEN  THE  CATEGORIES  OF  BOTH 
FACTORS  ARE  ORDERED  (SOMERS'  D ) ! PREDICTIVE 
MEASURES  WHEN  EITHER  OR  BOTH  FACTORS  ARE  QUALI- 
TATIVE (THE  GOODMAN  AND  KRUSKAL  TAU,  LAMBDA  AND 
LAMBDA!*)  AND  THE  UNCERTAINTY  COEFFICIENT)! 
MCNEMAR’S  TEST  OF  SYMMETRY!  AND  A TEST  OF  A 
CONTRAST  ON  THE  PROPORTIONS  IN  A 2 X K TABLE. 

STANDARD  ERRORS  ARE  PROVIDED  FOR  MANY  OF  THE 
HEA5URES  OF  ASSOCIATION  AND  PREDICTION. 

TWO-WAY  FREQUENCY  TABLES  --  EMPTY  CELLS  AND 
DEPARTURES  FROM  INDEPENDENCE 

TREATS  TWO  SPECIAL  PROBLEMS!  FIRST,  CELLS  CAN  BE 
EXCLUDED  FROM  THE  FREQUENCY  TABLE,  SUCH  AS  CELLS 
WITH  A PRIOR  PROBABILITY  OF  ZERO  (STRUCTURAL 
ZEROES).  P 2 F COMPUTES  THE  APPROPRIATE  TEST  OF 
INDEPENDENCE  BETWEEN  ROWS  AND  COLUMNS  (THE  CHI- 
SQUARE  TEST  OF  QU AS  l - I NDEPENDENCE  I . THIS  TEST 
USES  ALL  CELLS  THAT  ARE  NOT  EXCLUDED  A PRIORI  AND 
IS  IDENTICAL  TO  THE  USUAL  CHI-SQUARE  TEST  WHEN 
THERE  ARE  NO  EXCLUDED  CELLS.  SECOND,  CELLS  OR 
PATTERNS  OF  CELLS  CAN  BE  IDENTIFIED  THAT  CONTRIBUTE 
TO  A DEPARTURE  FROM  INDEPENDENCE  or  quasi-inde- 
pendence: I.E.,  CONTRIBUTE  GREATLY  TO  A SIGNIFI- 
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CANT  CHI-SQUARE  TEST.  A CRITERION  CAN  BE  SELECTED 
BY  WHICH  CELLS  ARE  IDENTIFIED  AND  ELIMINATED 
(TREATED  AS  EXCLUDED)  FROM  THE  TABLE  IN  A STEPWISE 
MANNER.  ONE  SUCH  CRITERION  IS  TO  ELIMINATE  THE 
CELL  THAT,  WHEN  EXCLUDED,  MINIMIZES  THE  CHI-SQUARE 
(OR  LIKELIHOOD  RATIO  CHI-SQUARE)  TEST  OF  QUASI- 
INDEPENDENCE PERFORMED  ON  ALL  CELLS  NOT  PREVIOUSLY 
EXCLUDED.  INFERENCES  CAN  BE  DRAWN  FROM  THE  DATA 
BY  EXAMINING  THE  PATTERN  OF  CELLS  ELIMINATED  BY 
THIS  PROCEDURE.  P 2F  COMPUTES  ONLY  THE  CHI-SQUARE 
TESTS  OF  INDEPENDENCE  (OR  Q U A S I - 1 N D E P E ND E N C E ) , 
EXPECTED  VALUES  ASSUMING  INDEPENDENCE,  AND  VARIOUS 
TYPES  OF  RESIDUALS.  IT  DOES  NOT  PROVIDE  ESTIMATES 
OF  THE  MEASURES  OF  ASSOCIATION  ANQ  PREDICTION  THAT 
ARE  AVAILABLE  IN  P 1 F . 

MULTIWAY  FREQUENCY  TABLES 

ANALYZES  DATA  IN  A MULTIWAY  TABLE.  THE  PURPOSE  OF 
THE  ANALYSIS  IS  TO  OBTAIN  A DESCRIPTION  OF  THE 
RELATIONSHIPS  BETWEEN  THE  FACTORS  OF  THE  TABLE, 
EITHER  BY  FORMING  A MODEL  FOR  THE  DATA  OR  BY 
TESTING  AND  ORDERING  THE  IMPORTANCE  OF  THE  INTER- 
ACTIONS BETWEEN  THE  FACTORS.  THE  ANALYSIS  IS 
BASED  ON  FITTING  A (HIERARCHICAL)  LOG-LINEAR  MODEL 
TO  THE  CELL  FREQUENCIES;  THAT  IS,  THE  LOGARITHM 
OF  THE  EXPECTED  CELL  FREQUENCY  IS  WRITTEN  AS  AN 
ADDITIVE  FUNCTION  OF  MAIN  EFFECTS  AND  INTERACTIONS 
IN  A MANNER  SIMILAR  TO  THE  USUAL  ANALYSIS  OF  VARI- 
ANCE MODEL.  IT  SHOULD  BE  NOTED  THAT  THE  VARIOUS 
TESTS  FOR  THE  INTERACTIONS  ARE  NOT  INDEPENDENT. 

IF  A TWO-WAY  TABLE  WERE  INPUT  TO  P3F,  ONE  POSSIBLE 
LOG-LINEAR  MODEL  WOULD  INCLUDE  TWO  MAIN  EFFECTS 
AND  A SINGLE  INTERACTION.  THE  TEST  THAT  THE 
INTERACTION  IS  ZERO  IS  IDENTICAL  TO  THE  TEST  FOR 
INDEPENDENCE  OBTAINED  IN  P1F.  P3F  CAN  BE  USED  TO 
FIND  AN  APPROPRIATE  MODEL  FOR  THE  DATA  IN  THE 
TABLE  BY  FITTING  SPECIFIED  MODELS  OR  BY  USING  A 
METHOD  TO  SCREEN  EFFECTS.  THE  PROGRAM  TESTS  THE 
APPROPRIATENESS  OF  MODELS  BY  THE  LIKELIHOOD  RATIO 
CHI-SQUARE  (G-SQUARE)  AND  BY  THE  USUAL  CHI-SQUARE 
GOODNESS-OF-F I T . P3F  PRINTS  THE  TABLE  OF  OBSERVED 
FREQUENCIES  AND  CAN  PRINT  MARGINAL  TABLES  OF  THE 
OBSERVED  FREQUENCIES  AS  WELL.  IT  CAN  TEST  CERTAIN 
CLASSES  OF  MODELS  OR  SPECIFIED  MODELS.  FOR  EACH 
SPECIFIED  MODEL  IT  CAN  PRINT  THE  EXPECTED  VALUES 
AND  TWO  TYPES  OF  RESIDUALS,  ESTIMATES  OF  THE 
PARAMETERS  OF  THE  MODEL  AND  THEIR  STANDARD  ERRORS, 
AND  TESTS-OF-FIT  OF  MODELS  THAT  DIFFER  FROM  THE 
SPECIFIED  MODEL  BY  ONE  EFFECT. 
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MULT ivariate  analysis 


CLUSTER  ANALYSIS  OF  VARIABLES 

CLUSTERS  SIMILAR  VARIABLES  BY  USING  AN  INITIAL 
MEASURE  OF  ASSOCIATION  BETWEEN  PAIRS  OF  VARIABLES 
(E.G.,  CORRELATION)  TO  FORM  A CLUSTER  OF  THE  TWO 
MOST  SIMILAR  VARIABLES  AND  THEN  USING  AN  AMALGAMA- 
TION (LINKAGE)  RULE  TO  FORM  FURTHER  CLUSTERS.  THE 
AMALGATION  RULE  DETERMINES  THE  DEGREE  OF  ASSOCIA- 
TION BETWEEN  ANY  TWO  clusters  where  a cluster 
CONSISTS  OF  ONE  OR  MORE  VARIABLES.  WHEN  THE  CLUS- 
TERING PROCESS  IS  FINISHED.  EACH  CLUSTER  CONSISTS 
OF  TWO  OR  MORE  VARIABLES  AND  EACH  VARIABLE  IS 
PLACED  IN  ONE  OR  MORE  CLUSTERS.  BOTH  THE  MEASURE 
OF  ASSOCIATION  FOR  THE  VARIABLES  AND  THE  AMALGAMA- 
TION RULE  FOR  THE  CLU5TERS  MAY  BE  SPECIFIED.  P1M 
ACCEPTS  A DATA  MATRIX  OR  A DISTANCE  MATRIX  (MEA- 
SURE OF  ASSOCIATION)  AS  INPUT.  OUTPUT  INCLUDES  A 
SUMMARY  TABLE  FOR  THE  CLUSTERING  PROCESS,  A TREE 
DIAGRAM  OF  CLUSTERS  SUPERIMPOSED  OVER  A SIMILARITY 
OR  DISTANCE  MATRIX  SCALED  FROM  0 TO  100,  A TABLE 
OF  SIMILARITY  OR  DISTANCE  MATRIX  SCALING,  AND  AN 
EXPLANATION  OF  THE  TREE  DIAGRAM  FOR  THE  FIRST 
PROBLEM.  THE  CORRELATION  MATRIX  CAN  ALSO  BE  REPRE- 
SENTED IN  SHADED  FORM  AFTER  REARRANGING  THE  ORDER 
OF  THE  VARIABLES  ACCORDING  TO  THE  CLUSTERS. 

CLUSTER  ANALYSIS  OF  CASES 

CLUSTERS  CASES  ACCORDING  TO  ONE  OF  FOUR  AVAILABLE 
DISTANCE  MEASURES.  THE  TWO  CASES  HAVING  THE  SHORT- 
EST DISTANCE  BETWEEN  THEM  ARE  AMALGAMATED  AND 
TREATED  AS  ONE  CASE  AND  THEN,  IN  TURN,  CLUSTERED 
WITH  OTHERS.  THIS  ALGORITHM  CONTINUES  UNTIL  ALL 
CASES  AND  CLUSTERS  ARE  AMALGAMATED  INTO  ONE  CLUS- 
TER. THE  DISTANCE  BETWEEN  CASES  IS  EITHER  THE  P-TH 
ROOT  OF  THE  SUMS  OF  THE  P-TH  POWERS  OF  DIFFERENCES 
(WHEN  P = 2 THIS  IS  THE  EUCLIDEAN  DISTANCE)  OR  CHI- 
SQUARE  OR  PHI-SQUARE  ( *CH I -SQUARE/N  ) WHERE  CHI- 
SQUARE  IS  COMPUTED  FROM  THE  TABLE  FORMED  BY  THE 
TWO  CASES  AND  THE  VARIABLES  (THE  LAST  TWO  DISTANCE 
MEASURES  ARE  USEFUL  WHEN  THE  DATA  ARE  COUNTS). 

A DIAGRAM  DRAWN  WITH  VERTICAL  LINES  TO  INDICATE 
CLUSTERING  OF  THE  CASES  IS  PRINTED,  AND  THE  ORDER 
OF  clustering  is  INDICATED  IN  THE  DIAGRAM,  option- 
al OUTPUT  INCLUDES  THE  INPUT  DATA  MATRIX  AFTER 
STANDARDIZATION,  THE  INITIAL  DISTANCE  MATRIX  BE- 
TWEEN CASES,  AND  A HORIZONTAL  CLUSTERING  TREE  D I A - 


- 
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gram,  the  distance  MATRIX  can  be  printed  in  shaded 

FORM  AFTER  REARRANGING  THC  CASES  ACCORDING  TO  THE 
CLUSTERS. 

BLOCK  CLUSTERING 

SIMULTANEOUSLY  CLUSTERS  CASES  AND  VARIABLES  OF  A 
DATA  MATRIX.  BLOCKS  (SUBMATRICES  OF  THE  DATA  MA- 
TRIX) ARE  IDENTIFIED  WHILE  THE  MARGINAL  ROWS  OF  A 
BLOCK  FORM  A CLUSTER  OF  CASES,  AND  THE  MARGINAL 
COLUMNS  OF  A BLOCK  FORM  A CLUSTER  OF  VARIABLES. 

THREE  TYPES  OF  CLUSTERS  ARE  THUS  DISCOVERED: 

CLUSTERS  OF  DATA  VALUES  (THE  BLOCKS),  CLUSTERS  OF 
CASES,  AND  CLUSTERS  OF  VARIABLES.  A BLOCK  DIAGRAM 
COMPLETELY  REPRESENTS  THE  OATA,  WITH  RELATIVELY 
FEW  PRINTED  SYMBOLS.  IN  THE  BLOCK,  THE  ROWS 
(CASES)  AND  COLUMNS  (VARIABLES!  OF  DATA  HAVE  BEEN 
PERMUTED,  AND  SMALLER  BLOCKS  (OR  SUBMATRICES)  OF 
SIMILAR  VALUES  ARE  OUTLINED.  SIMILAR  VALUES  ARE 
LEFT  BLANK  TO  GIVE  A GOOD  VISUAL  REPRESENTATION  OF 
LIKE  VALUES.  TO  PREVENT  OVERLAPPING,  THE  BLOCKS 
FORM  A HIERARCHY  (THAT  IS,  ANY  PAIR  OF  BLOCKS  ARE 
DISJOINT,  OR  ONE  INCLUDES  THE  OTHER).  AT  AN  EARLY 
STAGE  OF  DATA  ANALYSIS,  THE  PROGRAM  CAN  BE  USED  AS 
A MULTIVARIATE  HISTOGRAM.  IN  0 NE - D l ME N5 I ON  A L HIS- 
TOGRAMS REAL-VALUEO  VARIABLES  ARE  SEPARATED  INTO 
CATEGORIES  OF  EQUAL  LENGTH.  IN  A SENSE,  P3M  DOES 
THIS  TOO.  IT  SEPARATES  THE  VALUES  OF  EACH  VARIABLE 
INTO  A MAXIMUM  OF  3S  INTERVALS  AND  CODES  THEM 
I . 2 , . . . , 9 , A , B , . • , Z . FOR  EACH  VARIABLE  THE  NUMBER 
OF  CATEGORIES,  OR  INTERVALS  USED,  CaN  BE  SPECI- 
FIED. THE  OUTPUT  REPORTS  THE  NUMBER  OF  TIMES  EACH 
VARIABLE  TAKES  EACH  OF  ITS  CODED  VALUES.  TWO  TREES 
ARE  PRINTED,  ONE  REPORTING  THE  JOINING  SEQUENCE 
FOR  CASES  (ROW  CLUSTERS)  AND  THE  OTHER  REPORTING 
THE  JOINING  SEQUENCE  FOR  VARIABLES  (COLUMN  CLUS- 
TERS). ALSO  INCLUDED  IS  A BLOCK  DIAGRAM  ON  THE 
PERMUTED  DATA  MATRIX,  IN  WHICH  A LEADER  VALUE  AND 
EXCEPTIONS  ARE  PRINTED  FOR  EACH  BLOCK.  VALUES 
SIMILAR  TO  THE  LEADER  ARE  BLANKED  OUT. 

FACTOR  ANALYSIS 

PERFORMS  A factor  ANALYSIS  OF  EITHER  A CORRELATION 
OR  A COVARIANCE  MATRIX.  THE  INPUT  CAN  BE  DATA, 
CORRELATION  MATRIX,  COVARIANCE  MATRIX,  FACTOR  LOAD- 
INGS OR  factor  score  coefficients,  several  INITIAL 

COMMUNALITY  estimates  can  BE  USED,  INCLUDING  SQUARED 
MULTIPLE  CORRELATIONS,  MAXIMUM  ROW  VALUES  AND  VALUES 
SPECIFIED  BY  THE  USER.  INITIAL  FACTOR  EXTRACTION  CAN 
BE  OBTAINED  BY  PRINCIPAL  COMPONENTS,  MAXIMUM  LlKELI- 
HOOO , KAISER'S  SECOND  GENERATION  LITTLE  JIFFY,  OR 


56 


MATHEMATICAL  and  statistical  software  index 


BMDP6M 


BMDP7M 


ITERATED  PRINCIPAL  FACTOR  ANALYSIS.  SEVERAL  METHODS 
OF  ROTATION  ARE  AVAILABLE,  INCLUDING  VAR1MAX  AND 
DIRECT  QUART IM IN.  FACTOR  SCORES  ARE  COMPUTED  FOR 
EACH  CASE,  AND  MAHALANOBIS  DISTANCES  ARE  COMPUTED 
FROM  THE  CENTROID  OF  ALL  CASES  FOR  THE  FACTOR 
SCORES,  RAW  DATA,  AND  THE  RESIDUALS  OF  THE  ORIGINAL 
DATA  REGRESSED  ON  THE  FACTOR  SCORES.  OUTPUT  INCLUDES 
MEANS,  STANDARD  DEVIATIONS,  COEFFICIENTS  OF  VARIA- 
TION, MAX1MUMS  AND  M1NIMUMS,  STANDARD  SCORES,  CORRE- 
LATION MATRIX,  COVARIANCE  MATRIX,  FACTOR  LOADINGS 
AND  THEIR  PLOTS,  SQUARED  MULTIPLE  CORRELATIONS,  FAC- 
TOR CORRELATIONS,  FACTOR  STRUCTURE,  FACTOR  SCORES, 
FACTOR  SCORE  COEFFICIENTS,  FACTOR  SCORE  COVARIANCES 
AND  FACTOR  SCORE  PLOTS.  THE  CORRELATION  MATRIX  CAN 
BE  PRINTED  IN  SHADED  FORM  AFTER  REARRANGING  THE 
ORDER  OF  THE  VARIABLES  ACCORDING  TO  THE  FACTOR 
LOAD  1 NGS . 

CANONICAL  CORRELATION  ANALYSIS 

COMPUTES  CANONICAL  CORRELATION  ANALYSIS  FOR  TWO  SETS 
OF  VARIABLES.  INPUT  CAN  BE  A DATA  MATRIX,  COVARIANCE 
MATRIX  OR  A CORRELATION  MATRIX.  OUTPUT  INCLUDES 
MEANS,  STANDARD  DEVIATIONS,  COEFFICIENTS  OF  VARIA- 
TION, SKEWNESS,  KURTOSIS,  MINIMUMS,  MAXIMUMS,  CORRE- 
LATIONS, COVARIANCES,  CANONICAL  CORRELATIONS,  EIGEN- 
VALUES ASSOCIATED  WITH  EACH  PAIR  OF  CANONICAL  VARI- 
ABLES AND  bARTLETT*"  TEST  FOR  THE  SIGNIFICANCE  OF 
THE  REMAINING  EIGENVALUES,  COEFFICIENTS  FOR  CANONI- 
CAL VARIABLES,  The  SCORES  OF  THE  CANONICAL  variables 
FOR  EACH  CASE,  CORRELATIONS  OF  VARIABLES  WITH 
CANONICAL  VARIABLES  (LOADINGS)  AND  BIVARIATE  PLOTS 
FOR  ORIGINAL  VARIABLES  AND  CANONICAL  VARIABLES. 

STEPWISE  DISCRIMINANT  ANALYSIS 

PERFORMS  A MULTIPLE  GROUP  DISCRIMINANT  ANALYSIS.  THE 
VARIABLES  USED  IN  COMPUTING  THE  LINEAR  CLASSIFICA- 
TION FUNCTIONS  ARE  CHOSEN  IN  A STEPWISE  MANNER.  AT 
EACH  STEP  THE  VARIABLE  THAT  ADDS  MOST  TO  THE  SEPARA- 
TION OF  THE  GROUPS  IS  ENTERED.  BY  SPECIFYING  CON- 
TRASTS GROUP  DIFFERENCES  OF  INTEREST  MAY  BE  STATED. 
THESE  CONTRASTS  GUIDE  SELECTION  OF  THE  VARIABLES. 

FOR  EACH  CASE  THE  GROUP  CLASSIFICATIONS  ARE  EVAL- 
UATED. BASED  ON  THE  POSTERIOR  PROBABILITIES,  A CLAS- 
SIFICATION TABLE  IS  COMPUTED  (PRIOR  PROBABILITIES 
CAN  BE  SPECIFIED  FOR  USE  IN  THESE  COMPUTATIONS).  IN 
ADDITION  A J«CKKN 1 FE-V AL I D A T I ON  PROCEDURE  CAN  BE 
REQUESTED  TO  REDUCE  THE  BIAS  IN  THE  GROUP  CLASSIFI- 
CATIONS. THE  PROGRAM  COMPUTES  CANONICAL  DISCRIMINANT 
FUNCTIONS  AND  PLOTS  THE  FIRST  TWO  TO  GIVE  AN  OPTIMAL 
TWO-DIMENSIONAL  PICTURE  OF  THE  SEPARATION  OF  THE 
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6R0UPS.  OUTPUT  INCLUDES  MEANS,  STANDARD  DEVIATIONS, 

F STATISTICS,  AND  DEGREES  OF  FREEDOM  FOR  EACH  VARI- 
ABLE AT  EACH  STEP,  F STATISTICS  FOR  DISTANCES  BE- 
TWEEN PAIRS  OF  GROUPS,  WILKS’  LAMBDA  (U  STATISTIC) 

FOR  MULTIVARIATE  ANOVA,  MAHALANOBIS’  D-SQUARE  OF 
EACH  CASE  FROM  EACH  GROUP  MEAN,  CLASSIFICATION 
FUNCTIONS  AND  MATRICES,  AND  PERCENT  CORRECT  CLASSI- 
FICATION, COEFFICIENTS  FOR  CANONICAL  DISCRIMINANT 
FUNCTIONS,  CANONICAL  CORRELATIONS,  CANONICAL  VARIA- 
BLES ANO  A PLOT  OF  THE  FIRST  TWO  CANONICAL  VARIABLES. 

DESCRIPTION  AND  ESTIMATION  OF  MISSING  VALUES 
DESCRIBES  THE  PATTERN  OF  MISSING  VALUES  FOR  MULTI- 
VARIATE DATA  AND  PROVIDES  ESTIMATES  OF  THE  MISSING 
DATA.  ESTIMATES  CAN  BE  MEANS!  PREDICTED  VALUES  FOR 
EACH  MISSING  VARIABLE  DERIVED  FROM  REGRESSING  IT  ON 
THE  AVAILABLE  VARIABLE  WITH  WHICH  IT  IS  MOST  HIGHLY 
HIGHLY  CORRELATED!  PREDICTED  VALUES  FOR  EACH  MISS- 
ING VARIABLE  DERIVED  FROM  ITS  STEPWISE  REGRESSION 
ON  SOME  OF  THE  AVAILABLE  VARIABLES!  ANO  PREDICTED 
VALUES  FOR  EACH  MISSING  VARIABLE  DERIVED  FROM  RE- 
GRESSING IT  ON  ALL  AVAILABLE  VARIABLES.  THE  COVARI- 
ANCE MATRIX  CAN  BE  COMPUTED  USING  COMPLETE  CASES 
ONLY,  OR  USING  ALL  ACCEPTABLE  VALUES.  WHEN  THE 
LATTER  METHOD  IS  USED  OR  EXPLICITLY  REQUESTED,  THE 
EIGENVALUES  ARE  FOUND,  AND  THE  CORRELATION  MATRIX 
IS  REESTIMATED  USING  ONLY  POSITIVE  EIGENVALUES  AND 
THEIR  EIGENVECTORS.  A GROUPING  VARIABLE  AND  CASE 
WEIGHTS  CAN  BE  SPECIFIED.  THE  COVARIANCE  MATRIX  AND 
THE  DATA  WITH  ESTIMATES  REPLACING  THE  MISSING  VALUES 
CAN  BE  SAVED  IN  A B MDP  FILE.  THE  MATRIX  OF  PAIRWISE 
FREQUENCIES  OF  VARIABLES  CAN  ALSO  BE  SAVED  AND  ANA- 
LYZED IN  OTHER  BMDP  PROGRAMS  (SUCH  AS  FACTOR  ANALY- 
SIS PROGRAM,  BMDPRM)  TO  FURTHER  STUDY  THE  PATTERN  OF 
MISSING  VALUES.  PAM  CAN  PRINT  THE  DATA,  THE  PATTERN 
OF  MISSING  DATA,  SQUARED  MULTIPLE  CORRELATIONS  OF 
EACH  VARIABLE  WITH  ALL  OTHER  VARIABLES,  MISSING  DATA 
ESTIMATES  AND  MAHALANOBIS  DISTANCES  FOR  EACH  CASE  TO 
THE  CENTROID  OF  ALL  CASES,  BIVARIATE  PLOTS  FOR 
SPECIFIED  PAIRS  OF  VARIABLES,  THE  CORRELATION  MA- 
TRIX, EIGENVALUES  OF  THE  CORRELATION  MATRIX,  AND  THE 
COVARIANCE  MATRIX.  BIVARIATE  SCATTER  PLOTS  CAN  BE 
REQUESTED  FOR  ANY  PAIR  OF  VARIABLES.  IN  THESE  PLOTS 
ESTIMATED  VALUES  ARE  DISTINGUISHED  FROM  THE  ORIGINAL 
VALUES.  FOR  EACH  VARIABLE  IN  EACH  GROUP  PAM  PRINTS 
THE  SAMPLE  SIZE,  PERCENTAGE  MISSING,  MEAN,  STANDARD 
DEVIATION,  COEFFICIENT  OF  VARIATION,  MAXIMUM,  MINI- 
MUM, MAXIMUM  STANDARD  SCORE,  MINIMUM  STANDARD  SCORE, 
SKEWNESS,  AND  KURTOSIS. 
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BMDOIM  PRINCIPAL  COMPONENT  ANALYSIS 

COMPUTES  PRINCIPAL  COMPONENTS  OF  STANDARDIZED 
DATA,  AND  RANK  ORDERS  EACH  STANDARDIZED  CASE  BY 
THE  SIZE  OF  EACH  PRINCIPAL  COMPONENT  SEPARATELY. 
OUTPUT  INCLUDES  CORRELATION  C 0 E F F I C I E N T S , EIGEN- 
VALUES AND  RANK  ORDER  OF  EACH  STANDARDIZED  CASE. 
WILL  HANDLE  UP  TO  2S  ORIGINAL  VARIABLES  AND  400 
CASES. 

BMD02M  REGRESSION  ON  PRINCIPAL  COMPONENTS 

COMPUTES  PRINCIPAL  COMPONENTS  OF  STANDARDIZED  DATA 
AND  RANK  ORDERS  EACH  STANDARDIZED  CASE  BY  THE  SIZE 
OF  EACH  PRINCIPAL  COMPONENT  SEPARATELY.  EACH  DE- 
PENDENT VARIABLE  IS  REGRESSED  ON  THE  FIRST,  FIRST 
TWO,  FIRST  THREE,  AND  ALL  PRINCIPAL  COMPONENTS 
WHEN  EACH  COMPONENT  IS  EXPRESSED  IN  TERMS  OF  STAN- 
DARDIZED DATA. 

BMD03M  GENERAL  FACTOR  ANALYSIS 

PERFORMS  A PRINCIPAL  COMPONENT  SOLUTION  AND  OR- 
THOGONAL ROTATION  OF  THE  FACTOR  MATRIX.  DATA  INPUT 
MAY  BE  RAW  DATA,  A CORRELATION  MATRIX,  OR  A FACTOR 
MATRIX.  COMMONALITIES  ARE  ESTIMATED  FROM  THE 
SQUARED  MULTIPLE  CORRELATION  COEFFICIENTS  OR  THE 
MAXIMUM  ABSOLUTE  ROW  VALUES,  OR  THEY  MAY  BE  SPEC- 
IFIED BY  THE  USER  TO  REPLACE  THE  DIAGONAL  OF  THE 
CORRELATION  MATRIX.  WILL  HANDLE  BO  VARIABLES  MEA- 
SURED ON  9,999  SUBJECTS. 

BMUOMM  DISCRIMINANT  ANALYSIS  FOR  TWO  GROUPS 

COMPUTES  A LINEAR  FUNCTION  OF  P VARIABLES  MEASURED 
ON  EACH  INDIVIDUAL  OF  TWO  GROUPS',  THIS  CAN  SERVE 
AS  AN  INDEX  FOR  DISCRIMINATION  BETWEEN  THE  GROUPS. 
IT  IS  DETERMINED  FROM  THE  CRITERION  OF  "BEST"  IN 
THAT  THE  DIFFERENCE  BETWEEN  THE  MEAN  INDICES  FOR 
THE  TWO  GROUPS  DIVIDED  BY  A POOLED  STANDARD  DEVI- 
ATION OF  THE  INDICES  IS  MAXIMIZED  (ESSENTIALLY 
HOTELLING'S  T-SQUARE  TEST  ON  TWO  VECTORS  OF 
MEANS).  SEE  ALSO  BMDO  7M  • 

BMDOSM  DISCRIMINANT  ANALYSIS  FOR  SEVERAL  GROUPS 

DUPLICATES  THE  PROCEDURES  OF  BMD04M,  BUT  FOR  TWO  TO 

FIVE  GROUPS  WITH  UP  TO  2S  VARIABLES  AND  NO  GROUP 
EXCEEDING  I7S  IN  NUMBER.  INPUT  DATA  ARE  A SET  OF 
OBSERVATIONS  FOR  EACH  OF  THE  CLASSIFICATION  GROUPS 
AND  EACH  OBSERVATION  CONTAINS  A VALUE  FOR  EACH  OF 
THE  VARIABLES.  THE  COMPUTATIONAL  PROCEDURE  EVAL- 
UATES THE  COMPUTED  LINEAR  FUNCTION  CORRESPONDING 
TO  EACH  OF  THE  GROUPS  AND  ASSIGNS  AN  INDIVIDUAL  TO 
THE  GROUP  FOR  WHICH  THE  VALUE  IS  LARGEST.  SEE  ALSO 
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BMD06M 


BMD07M 


BMD  X 7 2 
(OR  BMDOBkI 


BMD  X 7 *4 
(OR  B M r>  1 0 M ) 


BMD07M. 

CANONICAL  CORRELATION  ANALYSIS 

COMPOTES  THE  CANONICAL  CORRELATIONS  BETWEEN  TWO 
SETS  OF  VARIABLES.  OUTPUT  INCLUDES  THE  SIMPLE  COR- 
RELATION MATRIX,  STANDARD  DEVIATIONS,  AND  CANONI- 
CAL COE  > C I ENTS  FOR  STANDARDIZED  VARIABLES.  WILL 
HANDLE  UP  TO  A TOTAL  OF  100  VARIABLES  WITH  THE 
SMALLER  OF  THE  TWO  SETS  OF  VARIABLES  CONTAINING  NO 
MORE  THAN  3S  VARIABLES. 

STEPWISE  DISCRIMINANT  ANALYSIS 

PERFORMS  A MULTIPLE  DISCRIMINANT  ANALYSIS  IN  A 
STEPWISE  MANNER  FOR  2 TO  80  GROUPS  WITH  UP  TO  80 
VARIABLES.  A SET  OF  LINEAR  CLASSIFICATION  FUNC- 
TIONS IS  COMPUTED  BY  CHOOSING  THE  INDEPENDENT 
VARIABLES  IN  A STEPWISE  MANNER.  USING  THESE  FUNC- 
TIONS AND  PRIOR  PROBABILITIES,  THE  POSTERIOR  PROB- 
ABILITIES OF  EACH  CASE  BELONGING  TO  EACH  GROUP 
ARE  COMPUTED.  ALSO  COMPUTES  COEFFICIENTS  FOR  CA- 
NONICAL VARIABLES  TO  GIVE  A TWO-DIMENSIONAL  PIC- 
TURE OF  GROUP  SEPARATIONS.  GROUPS  ARE  EVALUATED 
IN  PAIRS  FOR  SIGNIFICANCE  OF  DIFFERENCES  OF  MEAN 
BY  THE  F-RAT10  AND  HOTELL I NG  * 5 T-SQUARE. 

FACTOR  ANALYSIS 

PERFORMS  FACTOR  ANALYSIS  ON  A MAXIMUM  OF  1 9-fl  IN- 
PUT VARIABLES,  USING  EITHER  COVARIANCE  OR  CORRE- 
LATION MATRICES.  ;nITIAL  COMMUNALITY  ESTIMATES 
MAY  BE  SQUARED  MULTIPLE  CORRELATIONS,  REGRESSION 
VARIANCES,  MAXIMUM  ABSOLUTE  ROW  VALUES,  OR  THEY 
MAY  BE  SPECIFIED  BY  THE  USER.  IF  REQUESTED.  THE 
PROGRAM  WILL  ITERATE  ON  THE  INITIAL  COMMUNALITY 
ESTIMATES.  THREE  TYPES  OF  ROTATION  ARE  AVAILABLE, 
ALL  BASED  ON  THE  OBLlMIN  CRITERIA.  FACTOR  SCORES 
MAY  BE  ESTIMATED.  DATA  INPUT  MAY  BE  IN  THE  FORM 
OF  RAW  DATA,  A CORRELATION  MATRIX,  A COVARIANCE 
MATRIX,  OR  A FACTOR  LOADING  MATRIX.  THIS  PROGRAM 
SUPERSEDES  BMD03M. 

IDENTIFICATION  OF  OUTLIERS 

SCREENS  MULTIVARIATE  DATA  FOR  OUTLIERS  BY  COM- 
PUTING THE  MAHALANOBIS  DISTANCE  OF  EACH  CASE  FROM 
THE  CENTER  OF  THE  DISTRIBUTION  OF  THE  REMAINING 
CASES.  IF  THE  PROBABILITY  OF  THE  F-STATISTIC 
CORRESPONDING  TO  THE  GREATEST  DISTANCE  IS  SMALLER 
THAN  A SPECIFIED  VALUE,  THE  CASE  INVOLVED  IS  RE- 
MOVED. 
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BMDP  1 R 
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- REGRESSION  analysis 


MULTIPLE  LINEAR  REGRESSION 

COMPUTES  A MULTIPLE  LINEAR  REGRESSION  EQUATION  ON 
ALL  DATA  AND  ON  GROUPS  OR  SUBSETS  OF  THE  DATA!  EQUA- 
TIONS WITH  OR  WITHOUT  AN  INTERCEPT  CAN  BE  CHOSEN. 

IF  A GROUPING  VARIABLE  IS  SPECIFIED  TO  FORM  GROUPS, 
HOMOGENEITY  OF  REGRESSION  COEFFICIENTS  ACROSS  GROUPS 
IS  TESTED.  IT  IS  ALSO  POSSIBLE  TO  SPECIFY  CASE 
WEIGHTS.  FOR  ALL  DATA  AND  ALL  REQUESTED  GROUPS,  THE 
OUTPUT  INCLUDES  MEAN,  STANDARD  DEVIATION,  MINIMUMS 
MAXIMUMS,  MULTIPLE  R,  AND  STANDARD  ERROR  OF  ESTI- 
MATES FOR  EACH  VARIABLE*.  AN  ANALYSIS  OF  VARIANCE 
TABLE  CONSISTING  OF  REGRESSION  AND  RESIDUAL  SUM  OF 
SQUARES,  DEGREES  OF  FREEDOM  AND  MEAN  SQUARES  I F STA- 
TISTIC AND  PROBABILITY  FOR  THE  regression  equation; 
AND  THE  REGRESSION  COEFFICIENTS,  THEIR  STANDARD 
ERRORS,  AND  T STATISTICS  AND  PROBABILITIES.  A USER 
CAN  REQUEST  THE  COVARIANCE  OR  CORRELATION  MATRIX  I 
SCATTER  PLOTS,  NORMAL  AND  DETRENDED  NORMAL  PROBA- 
BILITY PLOTS  OF  RESIDUALS,  AND  PARTIAL  RESIDUAL 
PLOTS;  RESIDUALS,  PREDICTED  VALUES  and  DATA  FOR  EACH 
CASE. 

STEPWISE  REGRESSION 

ESTIMATES  THE  PARAMETERS  OF  MULTIPLE  LINEAR  REGRES- 
SION EQUATIONS  IN  A STEPWISE  MANNER.  FOUR  STEPPING 
ALGORITHMS  ARE  AVAILABLE.  VARIABLES  CAN  BE  FORCED 
INTO  THE  REGRESSION  EQUATION;  NONFORCED  VARIABLES 
CAN  BE  DIRECTED  IN  THEIR  ORDER  OF  ENTRY  BY  ASSIGNING 
THEM  TO  DIFFERENT  LEVELS.  REGRESSION  EQUATIONS  WITH 
OR  WITHOUT  AN  INTERCEPT  CAN  BE  CHOSEN,  OR  THE  INTER- 
CEPT CAN  BE  TREATED  AS  THOUGH  IT  WERE  AN  INDEPENDENT 
VARIABLE,  IN  WHICH  CASE  IT  ENTERS  THE  REGRESSION 
EQUATION  ONLY  IF  IT  IS  SIGNIFICANT.  BOTH  FORWARD  AND 
BACKWARD  STEPPING  ARE  POSSIBLE.  OUTPUT  INCLUDES  THE 
MEAN  ANO  STANDARD  DEVIATION  OF  EACH  VARIABLE.  THE 
COVARIANCE  OR  CORRELATION  MATRIX  CAN  BE  PRINTED. 

FOR  EACH  STEP  AN  ANALYSIS  OF  VARIANCE  TABLE  AND 
AND  MULTIPLE  CORRELATION  IS  PRINTED,  AND  IF  RE- 
QUESTED, COEFFICIENTS,  STANDARDIZED  COEFFICIENTS, 
STANDARD  ERRORS,  F RATIOS,  PARTIAL  CORRELATIONS 
AND  TOLERANCE.  OTHER  OPTIONAL  OUTPUT  INCLUDES 
SUMMARY  TABLES  FOR  F RATIOS,  PARTIAL  CORRELATIONS 
AND  COEFFICIENTS;  PREDICTED  VALUES,  RESIDUALS  AND 
DATA  FOR  EACH  CASE;  SCATTER  PLOTS  OF  PREDICTED  AND 
OBSERVED  VALUES  VERSUS  SELECTED  VARIABLE  VALUES, 
RESIDUALS  VERSUS  SELECTED  VARIABLES,  AND  NORMAL 
probability  plots  of  residuals. 
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NONLINEAR  REGRESSION 

OBTAINS  A LEAST  SQUARES  FIT  TO  A NONLINEAR  FUNCTION. 
IT  IS  APPROPRIATE  FOR  A WIDE  VARIETY  OF  PROBLEMS 
THAT  ARE  NOT  WELL-REPRESENTED  BY  EQUATIONS  THAT  ARE 
LINEAR  IN  THE  PARAMETERS.  FIVE  OF  THE  MOST  FREQUENT- 
LY USED  NONLINEAR  FUNCTIONS  ARE  AVAILABLE  BY  SIMPLY 
STATING  THEIR  CODE  NUMBER.  OTHER  FUNCTIONS  CAN  BE 
USED  BY  STATING  THE  FUNCTION  AND  ITS  DERIVATIVES  IN 
FORTRAN  STATEMENTS.  THE  FIT  IS  OBTAINED  BY  MEANS  OF 
GAUSS-NEWTON  ITERATIONS.  LINEAR  EQUALITY  CONSTRAINTS 
ON  THE  PARAMETERS  CAN  BE  STATED,  AND  CASE  WEIGHTS 
CAN  BE  USED.  PARAMETERS  CAN  ALSO  BE  ESTIMATED  BY 
MAXIMUM  LIKELIHOOD.  AFTER  EACH  ITERATION,  THE  PARA- 
METER VALUES  AND  RESIDUAL  SUM  OF  SQUARES  ARE  PRINTED. 
AFTER  THE  LAST  ITERATION,  THE  PROGRAM  REPORTS  THE 
ASYMPTOTIC  CORRELATIONS  AND  STANDARD  DEVIATIONS  FOR 
THE  ESTIMATED  PARAMETERS.  FOR  EACH  CASE  AFTER  THE 
LAST  ITERATION,  P 3 R LISTS  THE  PREDICTED  AND  OBSERVED 
VALUES  FOR  THE  DEPENDENT  VARIABLE,  RESIDUAL  VALUE, 

AND  OBSERVED  VALUES  FOR  THE  INDEPENDENT  VARIABLES. 
SCATTER  PLOTS  OF  PREDICTED  AND  OBSERVED  VALUES  VERSUS 
SELECTED  VARIABLE  VALUES  MAY  BE  REQUESTED,  AS  WELL 
AS  RESIDUALS  VERSUS  SELECTED  VARIABLES,  AND  NORMAL 

AND  DETRENDED  NORMAL  PROBABILITY  PLOTS  OF  RESIDUALS. 

REGRESSION  ON  PRINCIPAL  COMPONENTS 

COMPUTES  A REGRESSION  ANALYSIS  FOR  EACH  DEPENDENT 
VARIABLE  ON  A SET  OF  PRINCIPAL  COMPONENTS  COMPUTED 
FROM  THE  INDEPENDENT  VARIABLES.  THE  PRINCIPAL 
COMPONENTS  ARE  COMPUTED  FROM  THE  ORIGINAL  VARIABLES 
(USING  THE  COVARIANCE  MATRIX)  OR  THE  STANDARDIZED 
VARIABLES  (USING  THE  CORRELATION  MATRIX).  THE  RE- 
GRESSION ANALYSIS  IS  PERFORMED  IN  A STEPWISE  MANNER 
AND  THE  RESULTING  COEFFICIENTS  ARE  REPORTED  IN 
TERMS  OF  BOTH  PRINCIPAL  COMPONENTS  AND  THE  ORIGI- 
NAL OR  STANDARDIZED  VARIABLES.  THE  ORDER  OF  ENTRY 
OF  COMPONENTS  CAN  BE  BASED  ON  THE  MAGNITUDE  OF 
EIGENVALUES  OR  ON  THE  ABSOLUTE  MAGNITUDE  OF  CORRE- 
LATION BETWEEN  THE  COMPONENT  AND  THE  DEPENDENT 
VARIABLE.  OUTPUT  INCLUDES  MEANS,  STANDARD  DEVIA- 
TIONS, THE  COVARIANCE  MATRIX  (THE  CORRELATION  MA- 
TRIX IF  STANDARDIZATION  IS  REQUESTED!,  EIGENVALUES 
AND  EIGENVECTORS  (PRINCIPAL  COMPONENTS),  REGRES- 
SION COEFFICIENTS  FOR  PRINCIPAL  COMPONENTS  AND  FOR 
INDEPENDENT  VARIABLES,  RESIDUAL  SUM  OF  SQUARES  AND 
F RATIOS  FOR  EACH  STEP  IN  THE  REGRESSION,  AND  F 
RATIOS  FOR  EACH  COMPONENT.  PRINCIPAL  COMPONENT 

scores  may  be  requested,  as  well  as  scatter  plots 
OF  RAW  DATA,  and  normal  AND  DETRENDED  NORMAL  PROB- 
ABILITY PLOTS  OF  RESIOUALS. 
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POLYNOMIAL  REGRESSION 

A POLYNOMIAL  IN  ONE  VARIABLE  IS  FITTED  TO  THE  DE- 
PENDENT VARIABLE.  COMPUTATION  IS  DONE  USING  ORTHOG- 
ONAL POLYNOMIALS.  THE  PROGRAM  REPORTS  POLYNOMIALS 
OF  DEGREE  ONE  THROUGH  A DEGREE  SPECIFIED  BY  THE 
USER  (LESS  THAN  OR  EQUAL  TO  IS).  ESTIMATES  OF  THE 
REGRESSION  COEFFICIENTS  AND  THEIR  STANDARD  ERRORS 
ARE  REPORTED.  A SUMMARY  TABLE  OF  G 0 0 D N E S S - 0 F - F I T 
STATISTICS  FOR  EACH  (EXCEPT  THE  HIGHEST  DEGREE  ) 
POLYNOMIAL  IS  PRINTED.  FOR  EACH  DEGREE  A SCATTER 
PLOT  OF  THE  PREDICTED  AND  OBSERVED  VALUES  VERSUS 
THE  INDEPENDENT  VARIABLE,  AND  RESIDUAL  VERSUS  THE 
INDEPENDENT  VARIABLE  MAY  BE  REQUESTED.  THE  USER 
MAY  ALSO  REQUEST  NORMAL  AND  DETRENDED  NORMAL  PROB- 
ABILITY PLOTS  OF  THE  RESIDUALS,  THE  RESIDUALS 
LISTED  WITH  THE  DATA  AND  PREDICTED  VALUES,  AND  A 
CORRELATION  MATRIX  FOR  THE  REGRESSION  COEFFICIENT 
ESTIMATES. 

PARTIAL  CORRELATION  AND  MULTIVARIATE  REGRESSION 
COMPUTES  THE  PARTIAL  CORRELATIONS  OF  A SET  OF  VARI- 
ABLES REMOVING  THE  LINEAR  EFFECTS  OF  A SECOND  SET 
OF  VARIABLES.  THE  PROGRAM  CAN  ALSO  BE  USED  FOR 
REGRESSION,  ESPECIALLY  WHEN  THERE  ARE  MULTIPLE  DE- 
PENDENT VARIABLES  (SINCE  THE  COMPUTATIONS  FOR 
PARTIAL  CORRELATIONS  INCLUDE  THE  COMPUTATIONS  OF 
THE  REGRESSION  COEFFICIENTS  FOR  PREDICTING  THE 
FIRST  SET  OF  VARIABLES  FROM  THE  SECOND  SET  OF 
VARIABLES).  USING  DOUBLE  PRECISION,  P6R  CAN  BE 
USED  TO  CHECK  COMPUTATIONS  FROM  PlR  AND  P 2 R FOR 
PROBLEMS  THAT  MAY  BE  ILL-CONDITIONED  OR  INVOLVE  A 
LARGE  NUMBER  (THOUSANDS)  OF  CASES.  INPUT  CAN  BE  A 
DATA  MATRIX,  A COVARIANCE  MATRIX  OR  A CORRELATION 
MATRIX.  OUTPUT  INCLUDES  MEANS,  STANDARD  DEVIATIONS, 
COEFFICIENTS  OF  VARIATION,  SKEWNESS,  KURTOSIS, 
MINIMUMS,  MAXIMUMS,  CORRELATIONS,  COVARIANCES, 
PARTIAL  CORRELATIONS,  PARTIAL  COVARIANCES,  REGRES- 
SION COEFFICIENTS,  STANDARDIZED  COEFFICIENTS, 
STANDARD  ERRORS  FOR  COEFFICIENTS,  COVARIANCES  AND 
CORRELATIONS  FOR  REGRESSION  COEFFICIENTS,  T TESTS 
AND  SIGNIFICANCE  LEVELS  FOR  REGRESSION  COEFFI- 
CIENTS, TWO  TYPES  OF  SQUARED  MULTIPLE  CORRELATIONS 
AND  THEIR  SIGNIFICANCE  LEVELS  (THE  FIRST,  OF  EACH 
INDEPENDENT  VARIABLE  WITH  THE  OTHER  INDEPENDENT 
VARIABLES!  the  SECOND,  OF  EACH  dependent  VARIABLE 
WITH  ALL  OF  THE  INDEPENDENT  VARIABLES),  RESIDUALS 
FROM  PARTIAL  CORRELATION  ANALYSIS  AND  BIVARIATE 
PLOTS  FOR  ORIGINAL  VARIABLES  AND  RESIDUALS. 
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ALL  POSSIBLE  SUBSETS  REGRESSION 

IDENTIFIES  "BEST"  SUBSETS  OF  PREDICTOR  VARIABLES. 
BEST  IS  DEFINED  IN  TERMS  OF  THE  SAMPLE  R-SQUARE, 
ADJUSTED  R-SQUARE,  OR  MALLOWS’  C(P>.  FOR  EXAMPLE, 

IF  ADJUSTED  R-SQUARE  IS  CHOSEN,  THE  BEST  SUBSET  IS 
THE  SUBSET  THAT  MAXIMIZES  ADJUSTED  R-SQUARE.  NOTE 
THAT  BEST  IS  DEFINED  AS  NUMERICALLY  BEST  IN  TERMS 
OF  THE  OBSERVED  SAMPLE  AND  THAT  IAS  FOR  STEPWISE 
REGRESSION)  NO  IMPLICATION  IS  MADE  THAT  ANY  SUBSET 
CAN  BE  CONSIDERED  BEST  FOR  THE  POPULATION  FROM 
WHICH  THE  SAMPLE  WAS  TAKEN.  HOWEVER,  IAS  FOR 
STEPWISE  REGRESSION)  A WELL-CHOSEN  SUBSET  FOR  ONE 
SAMPLE  IS  LIKELY  TO  BE  GOOD  FOR  SIMILAR  SAMPLES. 

THE  NUMBER  OF  BEST  SUBSETS  CAN  BE  SPECIFIED!  UP 
TO  TEN  CAN  BE  REQUESTED.  THUS,  NOT  ONLY  THE  BEST 
BUT  ALSO  THE  SECOND  BEST,  THIRD  BEST,  ETC.  SUBSETS 
ARE  IDENTIFIED  TO  PROVIDE  SEVERAL  GOOD  ALTERNA- 
TIVES. WHEN  M BEST  SUBSETS  ARE  REQUESTED,  THE  R- 
SQUARE  CRITERION  REQUESTS  THAT  THE  M BEST  SUBSETS 
OF  EACH  SUBSET  SIZE  BE  DETERMINED.  WHEN  THE  AD- 
JUSTED R-SQUARE  OR  MALLOWS’  CIP)  CRITERION  IS 
CHOSEN,  THE  M BEST  SUBSETS  ARE  FOUND  REGARDLESS  OF 
SUBSET  SIZE.  THE  ALGORITHM  USED  IDENTIFIES  THE 
BEST  SUBSETS  WHILE  COMPUTING  ONLY  A SMALL  FRACTION 
OF  ALL  POSSIBLE  REGRESSIONS  AND  IS  SUBSTANTIALLY 
MORE  EFFICIENT  THAN  THOSE  PREVIOUSLY  AVAILABLE 
INPUT  CAN  BE  DATA  OR  A CORRELATION  OR  COVARIANCE 
MATRIX.  FOR  UP  TO  APPROXIMATELY  27  VARIABLES,  COM- 
PUTER COSTS  ARE  COMPARABLE  TO  COSTS  FOR  STEPWISE 
REGRESSION.  OUTPUT  INCLUDES  MEANS,  STANDARD  DEVI- 
ATIONS, COEFFICIENTS  OF  VARIATION,  MIN1MUMS,  MAXI- 
MUMS,  MINIMUM  AND  MAXIMUM  STANDARD  SCORES,  SKEWNESS 
AND  KURTOSIS,  CORRELATION  MATRIX,  COVARIANCE  MATRIX, 
ANO  BIVARIATE  PLOTS  FOR  ANY  PAIR  OF  VARIABLES. 

FOR  EACH  OF  THE  M BEST  SUBSETS,  THE  SELECTION  CRI- 
TERION, REGRESSION  COEFFICIENTS  AND  F RATIO  ARE 
PRINTED.  FOR  THE  BEST  SUBSET  P9R  PRINTS  R-SQUARE, 

F RATIO  AND  SIGNIFICANCE!  REGRESSION  COEFFICIENTS, 
STANDARDIZED  REGRESSION  COEFFICIENTS,  STANDARD 
ERRORS,  T TESTS,  SIGNIFICANCE  AND  TOLERANCE! 
PREDICTED  VALUES,  RESIDUALS,  WEIGHTED  RESIDUALS, 
STANDARDIZED  RESIDUALS,  DELETED  RESIDUALS,  AND 
COOK’S  MEASURE  OF  THE  INFLUENCE  OF  EACH  CASE  ON  THE 
REGRESSION  EQUATION,  BIVARIATE  SCATTER  PLOTS  CAN 
BE  REQUESTED  FOR  ANY  PAIR  OF  VARIABLES  INCLUDING 
RESIDUALS,  STANDARDIZED  RESIDUALS,  ETC.  A NORMAL 
PROBABILITY  PLOT  FOR  THE  STANDARDIZED  RESIDUALS 
CAN  BE  PRINTED. 


64 


mathematical 

BMDO  1 R 

BMD02R 


BMD03K 

BMDOMR 


BMD05R 


L 


AND  STATISTICAL  SOFTWARE  INDEX 


SIMPLE  LINEAR  REGRESSION 
(ONE-WAY  ANALYSIS  OF  COVARIANCE! 

SIMPLE  LINEAR  REGRESSION  ANALYSIS  ON  SINGLE  OR 
COMBINED  CATEGORIES  WITH  UNEQUAL  SAMPLE  SIZES. 
ANALYSIS  OF  COVARIANCE  INFORMATION  IS  ALSO  OUT- 
PUT. WILL  HANDLE  UP  TO  999  GROUPS  WITH  UP  TO  979 
SUBJECTS. 

STEPWISE  REGRESSION 

COMPUTES  A SEQUENCE  Of  MULTIPLE  LINEAR  REGRESSION 
EQUATIONS  IN  A STEPWISE  MANNER.  AT  EACH  STEP  ONE 
VARIABLE  IS  ADDED  TO  THE  REGRESSION  EQUATION.  THE 
VARIABLE  ADDED  IS  THE  ONE  WHICH  MAKES  THE  GREATEST 
REDUCTION  IN  THE  ERROR  SUM  OF  SQUARES.  ALSO,  VARI- 
ABLES CAN  BE  FORCED  INTO  THE  REGRESSION  EQUATION. 
NON-FORCED  VARIABLES  ARE  AUTOMATICALLY  REMOVED 
WHEN  THEIR  F VALUES  BECOME  TOO  LOW.  PLOTS  OF  RE- 
SIDUALS ARE  AVAILABLE.  WILL  HANDLE  80  VARIABLES 
AND  9,999  SUBJECTS. 

MULTIPLE  REGRESSION  WITH  CASE  COMBINATIONS 
PERFORMS  MULTIPLE  REGRESSION  AND  CORRELATION  ANAL- 
YSES ON  THE  DATA  WITHIN  EACH  SELECTION  OF  SUBSAM- 
PLES FROM  THE  SAME  POPULATION.  SELECTIONS  NEED  NOT 
NOT  BE  MUTUALLY  EXCLUSIVE.  WILL  HANDLE  UP  TO  28 
SUBSAMPLES,  50  VARIABLES,  AND  TOTAL  SAMPLE  SIZE  OF 
99,999  FOR  ALL  SUBSAMPLES  COMBINED. 

PERIODIC  REGRESSION  AND  HARMONIC  ANALYSIS 
PERFORMS  PERIODIC  OR  HARMONIC  REGRESSION  ANALYSIS 
USING  THE  MODEL, 

Y ( T ) = A ( 0 ) ♦ THE  SUM  (FROM  I « 1 TO  N ) OF 

C A ( I ) *COS < 2*P  I • I *T/K  I ♦ 

B ( I ) *S I N ( 2*P I • I *T/K ) D , 

WHERE  N IS  LESS  THAN  OR  EQUAL  TO  9.  PERIODIC  RE- 
GRESSIONS ARE  COMPUTED  FOR  SUCCESSIVE  VALUES  OF  N, 
INCLUDING  ANALYSIS  OF  VARIANCE  AND/OR  COVARIANCE 
TABLE  FOR  EACH  REGRESSION. 

POLYNOMIAL  REGRESSION 

COMPUTES  POLYNOMIAL  REGRESSIONS  OF  THE  FORM 

Y = BIO)  ♦ B ( 1 ) • X ♦ B ( 2 ) • X • • 2 ♦ ... 

♦ B ( K ) • X • • K ♦ E, 

WHERE  K IS  A POSITIVE  INTEGER  LESS  THAN  OR  EQUAL  TO 
10,  AND  THE  NUMBER  OF  CASES  IS  LESS  THAN  OR  EQUAL 
TO  500. 
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BMD06R  ASYMPTOTIC  REGRESSION 

PERFORMS  ASYMPTOTIC  REGRESSION  ANALYSES  USING  THE 
MODIFIED  EXPONENTIAL  FUNCTION  OF  THE  FORM 

Y * A ♦ 0 • R • * X • 

TRANSGENERATION  OF  Y CAN  BE  USED. 

BMDX8S  NONLINEAR  LEAST  SQUARES 

(OR  B M D 0 7 R ) THIS  PROGRAM  OBTAINS  A L E A S T - S Q U A R E S FIT 

YIK)  « FC X ( 1 ,K  X I T ,K  IS S ( I S ( P ) D ♦ EIKI 

OF  A USER-SPECIFIED  FUNCTION  F TO  DATA  VALUES 
XII, K)  AND  Y ( K ) BY  MEANS  OF  STEPWISE  GAUSS-NEWTON 

ITERATIONS  ON  THE  PARAMETERS  5(1) SIP).  IF 

NECESSARY,  ONLY  A SUBSET  OF  THE  PARAMETERS  WILL  BE 
MODIFIED  IN  A GIVEN  ITERATION  IN  ORDER  TO  AVOID 
THE  PROP  LEMS  OF  SINGULARITY. 


CLASS  S - SPECIAL  PROGRAMS 


BMDP1S  MULTIPASS  TRANSFORMATION 

PIS  CAN  BE  USED  WHEN  INFORMATION  FROM  THE  DATA 
) ILE  IS  NEEDED  TO  COMPUTE  TRANSFORMATIONS  ( I • E • • 

ALL  THE  DATA  MU5T  BE  SCANNED  OR  PASSED  BEFORE 
TRANSFORMA  I ONS  CAN  BE  MADE!  FOR  EXAMPLE,  MISSING 
VALUES  MAY  NEED  TO  BE  REPLACED  BY  THE  MEAN  OF  THE 
OF  THE  VALUES  PRESENT  IN  THE  SAMPLE).  IN  EACH  PASS 
A FURTHER  TRANSFORMATION  OF  THE  DATA  CAN  BE  SPECI- 
FIED THROUGH  FORTRAN  STATEMENTS.  ARITHMETIC  MEANS, 
STANDARD  DEVIATIONS,  GEOMETRIC  MEANS.  HARMONIC 
MEANS,  AND  LARGEST  AND  SMALLEST  VALUES  FOR  EACH 
VARIABLE  CAN  BE  COMPUTED  IN  EACH  TRANSFORMATION 
PASS  TO  BE  USED  IN  THE  NEXT  PASS.  (ALL  THE  BMDP 
PROGRAMS  HAVE  THE  CAPABILITY  OF  DOING  SIMPLE 
TRANSFORMATIONS  AND  OATA  EDITING  PROCEDURES.) 

OUTPUT  FOR  EACH  PAS5  INCLUDES  THE  TRANSFORMED  DATA 
AND  ANY  OF  THE  STATISTICS  COMPUTED  FOR  EACH  SE- 
LECTED VARIABLE.  THE  TRANSFORMED  DATA  CAN  BE  SAVED 
ON  A BMDP  FILE. 

BMDP3S  NONPAR AMETR I C STATISTICS 

COMPUTES  ONE  OR  MORE  OF  THE  FOLLOWING  NONP A R A ME T R l C 
STATISTICS  FROM  A GIVEN  SET  OF  DATA:  SIGN  TEST, 

W I LCOXON-S I GNED  RANK  TEST,  KENDALL  RANK  CORRELATION 
COEFFICIENT,  SPEARMAN  RANK  CORRELATION  COEFFICIENT, 
FRIEDMAN  TWO-WAY  ANALYSIS  OF  VARIANCE,  KENDALL 
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COEFFICIENT  OF  CONCORDANCE.  MANN-WHITNEY  U RANK 
SUM  TEST,  AND  K R U S K A L - W A L L 1 S ONE-WAY  ANALYSIS  OF 
VAR  I ANCE  . 


BMDOIS  LIFE  TABLE  AND  SURVIVAL  RATE 

COMPUTES  PROPORTIONS  SURVIVING,  SURVIVAL  RATES,  AND 
STANDARD  ERRORS  FOR  SUCCESSIVELY  REDUCED  TIME 
PERIODS  AND  PRINTS  A PLOT  OF  THE  CUMULATIVE  PRO- 
PORTION SURVIVING. 


BMD02S  CONTINGENCY  TABLE  ANALYSIS 

COMPUTES  TWO-WAY  FREQUENCY  AND  PERCENTAGE  TABLES, 
CHI-SQUARES,  CONTINGENCY  COEFFICIENTS,  AND  MAXIMUM 
LIKELIHOOD  RATIOS.  EACH  VARIABLE  MAY  BE  CATEGORIZED 
IN  SEVERAL  DIFFERENT  WAYS.  OUTPUT  INCLUDES  FREQUEN- 
CY TABLES  WITH  PERCENTAGES  COMPUTED  FOR  EACH  ROW 
AND  COLUMN.  PERCENTAGES  ARE  ALSO  COMPUTED  FOR  THE 
ENTIRE  TABLE. 

BMD03S  BIOLOGICAL  ASSAY!  PROBIT  ANALYSIS 

OBTAINS  MAXIMUM  LIKELIHOOD  ESTIMATES  FOR  THE  PA- 
RAMETERS A AND  B IN  THE  PROBIT  EQUATION 

Y = A ♦ B • X « 

AS  AN  OPTION,  AN  ESTIMATE  OF  THE  NATURAL  (THRESH- 
OLD) RESPONSE  RATE  IS  ALSO  OBTAINED. 

BMD09S  TRANSGENERATION 

PERFORMS  SELECTED  T R A N S G E N E R A T I 0 N S ON  SPECIFIED 
VARIABLES  IN  THE  DATA.  ANY  OF  THE  STANDARD  BMD 
TRANSGENERATION  CODES  MAY  BE  SELECTED.  SEE  ALSO 
BMDX77  (OR  BMD12SI. 

BMDX  7 7 OPEN-ENDED  T R A N S GE NE R A T I 0 N 

(OR  BMD12S)  PERFORMS  T R A N S G E N E R A T I 0 N BY  MEANS  OF  U S E R - SUP P L I E D 
FORTRAN  STATEMENTS,  FOR  STANDARD  USE  ONLY  THE 
ARITHMETIC  STATEMENTS  ARE  REQUIRED,  ALTHOUGH  THE 
ENTIRE  FORTRAN  IV  LANGUAGE  IS  AVAILABLE.  SEE  ALSO 
BMD09S. 


class  t - time  series  analysis 


BMDOIT  AMPLITUDE  AND  PHASE  ANALYSIS 

COMPUTES  AMPLITUDE  AND  PHASE  OF  MODERATELY  WIDE- 
BAND NOISE  AND  NOISE  CONTAMINATED  BY  EXTRANEOUS 
NOISE,  USING  A PAIR  OF  FINITE  MOVING  AVERAGES  ON 
THE  SAMPLE  NOISE.  A GENERALIZED  TUK  E Y FILTER  WITH 
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A VARIABLE  NUMBER  OF  TRIANGLES  AND  RESOLUTIONS  IS 
USED. 

BM002T  AUTOCOVARIANCE  AND  POWER  SPECTRAL  ANALYSIS 

COMPUTES  THE  AUTOCOVARIANCE,  POWER  SPECTRUM, 
CROSS-COVARIANCE,  CROSS-SPECTRUM,  TRANSFER-FUNC- 
TION, AND  COHERENCE-FUNCTION  OF  TIME  SERIES. 

PLOTS  OF  EACH  ARE  INCLUDED. 


BMDX68  MULTIPLE  TIME  SERIES  SPECTRAL  ANALYSIS 

(OR  BMDOVTI  BEGINNING  WITH  A SEQUENCE  OF  CROSS-SPECTRAL  MA- 
TRICES, NORMALLY  GENERATED  BY  BM0X92.  THIS  PRO- 
GRAM ESTIMATES  MULTIPLE  COHERENCE  FUNCTIONS 
AND  FREQUENCY-RESPONSE  FUNCTIONS  BETWEEN  A SET  OF 
INPUT  AND  A SET  OF  OUTPUT  SERIES.  CONFIDENCE 
BANDS  ARE  OBTAINED  FOR  THE  GAIN  AND  PHASE  OF  THE 
COMPONENTS  OF  THE  FREQUENCY  RESPONSE  FUNCTIONS. 
ESTIMATES  OF  THE  C R 0 S S -S P E C T R A OF  THE  OUTPUT 
SERIES  PART  1 ALED  ON  THE  INPUT  SERIES  ARE  ALSO 
OBTAINED. 


BMDX92  TIME  SERIES  SPECTRUM  ESTIMATION 

(OR  BMD03TI  ESTIMATES  AUTO-SPECTRA,  CROSS-SPECTRA,  AND  CO- 
HERENCES FOR  STATIONARY  TIME  SERIES.  EACH  SERIES 
IS  DECOMPOSED  INTO  ITS  COMPONENT  FREQUENCIES  BY 
EVALUATING  ITS  FINITE  FOURIER  TRANSFORM. 


CLASS  V - VARIANCE  ANALYSIS 


BMDP1V  ONE-WAY  ANALYSIS  OF  VARIANCE  AND  COVARIANCE 

PERFORMS  A ONE-WAY  ANALYSIS  OF  VARIANCE,  OR  A ONE- 
WAY ANALYSIS  OF  COVARIANCE  IF  COVARIATES  ARE 
SPECIFIED.  GROUP  SIZES  CAN  BE  UNEQUAL  AND 
PARALLEL  ANALYSES  CAN  BE  PERFORMED  USING  SEVERAL 
DEPENDENT  VARIABLES.  THE  PROGRAM  TESTS  FOR 
EQUALITY  OF  SLOPES  FROM  GROUP  TO  GROUP  AND  COM- 
PUTES T STATISTICS  FOR  CONTRASTS  OF  ADJUSTED  GROUP 
MEANS  INCLUDING,  IN  PARTICULAR,  ALL  PAIRWISE 
DIFFERENCES.  THE  ANALYSIS  OF  COVARIANCE  IS 
ACCOMPANIED  BY  FOUR  TYPES  OF  GRAPHS  FOR  EACH 
GROUP:  SCATTER  PLOTS  OF  EACH  COVARIATE  VERSUS  THE 

OEPENOENT  VARIABLE  AND  ITS  PREDICTED  VALUES,  EACH 
COVARIATE  VERSUS  THE  RESIDUALS,  THE  RESIDUAL 
VERSUS  THE  PREDICTED  VALUES,  AND  THE  SQUARED  RESI- 
DUALS VERSUS  THE  PREDICTED  VALUES.  OUTPUT  FROM 
THE  ANALYSIS  OF  VARIANCE  ON  EACH  VARIABLE  INCLUDES 
THE  GROUP  MEANS,  AN  ANALYSIS  OF  VARIANCE  TABLE, 
PAIRWISE  T TESTS  FOR  GROUP  MEANS  AND  T TESTS  FOR 
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CONTRASTS  OF  GROUP  MEANS.  FOR  ANALYSIS  OF  COVAR- 
IANCE ON  EACH  DEPENDENT  VARIABLE  PlV  REPORTS 
REGRESSION  COEFFICIENTS,  THEIR  STANDARD  ERRORS  AND 
T VALUES,  6R0UP  MEANS,  ADJUSTED  GROUP  MEANS,  STAN- 
DARD ERRORS  OF  ADJUSTED  GROUP  MEANS,  AN  ANAL YS I S 
OF  VARIANCE  TABLE  WITH  F TEST  FOR  EQUALITY  OF 
ADJUSTED  GROUP  MEANS,  ZERO  SLOPE,  EQUALITY  OF 
SLOPES,  PAIRWISE  T TEST  FOR  ADJUSTED  GROUP  MEANS, 

T TESTS  FOR  CONTRASTS  OF  ADJUSTED  GROUP  MEANS,  AND 
REGRESSION  COEFFICIENTS  FOR  EACH  COvARIATE  IN  EACH 
GROUP.  OTHER  OUTPUT  AVAILABLE  INCLUDES  THE  COR- 
RELATION MATRIX  FOR  THE  REGRESSION  COEFFICIENTS 
AND  THE  ADJUSTED  GROUP  MEANS,  MAXIMUM,  MINIMUM  AND 
MEAN  OF  EACH  VARIABLE  IN  EACH  GROUP,  VARIANCE- 
COVARIANCE  MATRIX,  CORRELATION  MATRIX  FOR  EACH 
GROUP,  AND  TOTAL,  BETWEEN  AND  WITHIN  VARIANCE- 
COVARIANCE  MATRICES. 

BMDP2V  ANALYSIS  OF  VARIANCE  AND  COVARIANCE,  INCLUDING 

REPEATED  MEASURES 

PERFORMS  ANALYSES  OF  VARIANCE  OR  COVARIANCE  FOR 
GENERAL  FIXED  EFFECTS  AND  REPEATED  MEASURES 
MODELS.  FOR  EACH  SUBJECT  IN  REPEATED  MEASURES 
MODELS,  THE  TRIAL  FACTORS  (REPEATED  MEASURES  FAC- 
TORS, FOR  WHICH  EACH  SUBJECT  IS  MEASUREO  AT  EACH 
LEVEL)  MUST  HAVE  A COMPLETE  FACTORIAL  STRUCTURE 
WITH  NO  MISSING  OBSERVATIONS.  GROUP  INDICES  ARE 
READ  AS  OATA  AND  DETERMINE  THE  GROUP  MEMBERSHIPS 
WITHOUT  FURTHER  SPECIFICATION.  P2 V ALSO  HANDLES 
MODELS  THAT  CONTAIN  TRIAL  FACTORS  ONLY  OR  GROUPING 
FACTORS  ONLY.  W I TH I N-SUB JECT  RESPONSES  NEED  NOT 
BE  INDEPENDENT,  BUT  BETWEEN-SUB JECT  RESPONSES  ARE 
ASSUMED  TO  BE.  ALL  FACTORS,  EXCEPT  SUBJECTS,  ARE 
ASSUMED  FIXED.  AN  ORTHOGONAL  DECOMPOSITION  OF  THE 
TRIAL  EFFECTS  CAN  BE  REQUESTED.  COVARIATES  MAY  BE 
(BUT  NEED  NOT  BE)  CONSTANT  ACROSS  TRIALS.  UNEQUAL 
CELL  SIZES  ARE  ALLOWED.  SEVERAL  DISTINCT  HYPOTH- 
ESES CAN  BE  TESTED,  DEPENDING  ON  THE  PROBLEM.  THE 
HYPOTHESES  TESTED  IN  THIS  PROGRAM  ARE  GENERALLY 
APPROPRIATE  FOR  EXPERIMENTAL  DATA.  OUTPUT  IN- 
CLUDES AN  ANALYSIS  OF  VARIANCE  TABLE  CONSISTING  OF 
SUMS  OF  SQUARES,  DEGREES  OF  FREEDOM,  MEAN  SQUARES, 
AND  F STATISTICS  (WITH  PROBABILITY  VALUES  ASSOCI- 
ATED WITH  EACH),  CELL  MEANS  AND  STANDARD  DEVI- 
ATIONS FOR  THE  DEPENDENT  VARIABLE  AND  COVARIATES, 
ADJUSTED  CELL  MEANS  AND  REGRESSION  COEFFICIENTS 
(WHEN  COVARIATES  ARE  PRESENT),  AND  DESIGN 
INFORMATION. 
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BMDO  l V 


B MD02  V 


8 M D 0 3 V 


BMD09  V 


8M005 V 


BMD06 V 


ANALYSIS  OF  VARIANCE  OF  ONE-WAT  DESIGN 
COMPUTES  AN  ANALYSIS  OF  VARIANCE  TABLE  FOR  ONE 
VARIABLE  OF  CLASSIFICATION,  WITH  UNEQUAL  GROUP 
SAMPLE  SIZES.  PRINT-OUT  IS  SELF-EXPLANATORY.  THE 
PROGRAM  TAKES  UP  TO  5,000  GROUPS,  WITH  A TOTAL 
SAMPLE  SIZE  OF  100,000,000. 

ANALYSIS  OF  VARIANCE  FOR  FACTORIAL  DESIGN 
COMPUTES  ANALYSIS  OF  VARIANCE  FOR  A FACTORIAL  DE- 
SIGN. A BREAKDOWN  OF  THE  SUMS  OF  SQUARES  INTO  OR- 
THOGONAL POLYNOMIAL  COMPONENTS  FOR  AS  MANY  AS  FOUR 
MAIN  EFFECTS  AND  ALL  OF  THEIR  FIRST  ORDER  INTER- 
ACTIONS FOR  UP  TO  EIGHT  SPECIFIED  VARIABLES  ARE 
COMPUTED.  CELL  AND  MARGINAL  MEANS  ARE  LISTED, 
PRINT-OUT  IS  SELF-EXPLANATORY.  PROGRAM  WILL  TAKE 
UP  TO  EIGHT  FACTORS  WITH  999  SUBJECTS  PER  CELL, 
CELLS  NOT  TO  EXCEED  18,000  IN  NUMBER.  WILL  ALSO  DO 
TREND  ANALYSIS. 

ANALYSIS  OF  COVARIANCE  FOR  FACTORIAL  DESIGN 
COMPUTES  A FULL  FACTORIAL  ANALYSIS  OF  COVARIANCE 
FOR  A MAXIMUM  OF  SIX  FACTORIAL  COMPONENTS  AND  A 
MAXIMUM  OF  EIGHT  COVARIATES.  ADJUSTED  RESIDUALS 
AND  REGRESSION  COEFFICIENTS  ARE  COMPUTED.  THE  IN- 
VERSE OF  THE  COVARIANCE  MATRIX  OF  THE  COVARIATES 
IS  PRINTED. 

ANALYSIS  OF  COVARIANCE  WITH  MULTIPLE  COVARIATES 
COMPUTES  ANALYSIS  OF  COVARIANCE  INFORMATION  FOR 
ONE  ANALYSIS  OF  VARIANCE  VARIABLE  WITH  MULTIPLE 
COVARIATES  AND  UNEQUAL  TREATMENT  GROUP  SIZES. 

CASES  MAY  BE  SPECIFIED  BY  THE  USER  AS  BEING  IN 
CERTAIN  CATEGORIES,  OR  CASES  MAY  BE  PLACED  IN  CAT- 
EGORIES BY  THE  PROGRAM  IN  ACCORDANCE  WITH  A SPECI- 
FIED CRITERION.  WILL  TAKE  UP  TO  35  COVARUTES  IN 
99  GROUPS,  NONE  TO  EXCEED  999  SUBJECTS.  SEE  ALSO 
BMDX82  AND  BMDO 1 R » 

GENERAL  LINEAR  HYPOTHESIS 

PERFORMS  THE  CALCULATIONS  REQUIRED  FOR  A GENERAL 
LINEAR  HYPOTHESIS  MODEL.  CAN  BE  USED  FOR  BALANCED 
OR  UNBALANCED  ANALYSIS  OF  VARIANCE  OR  COVARIANCE 
DESIGNS  ANO  MISSING-VALUE  PROBLEMS.  THE  INDEPEN- 
DENT VARIABLES  ARE  OF  TWO  GENERAL  TYPES!  THE  VARI- 
ABLES SPECIFYING  THE  ANALYSIS  OF  VARIANCE  CLASSI- 
FICATIONS AND  THE  VARIABLES  WHICH  ARE  COVARIATES. 
SEE  ALSO  BMDX69  (OR  BMD10V). 

GENERAL  LINEAR  HYPOTHESIS  WITH  CONTRASTS 
SIMILAR  TO  BMD05V,  DESIGNED  TO  ESTIMATE  AND  TEST 
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THE  STATISTICAL  SIGNIFICANCE  OF  THE  PARAMETERS 
WHICH  OCCUR  IN  THE  GENERAL  LINEAR  HYPOTHESIS  MOD- 
EL. ALSO,  IT  CAN  TEST  THE  STATISTICAL  SIGNIFI- 
CANCE OF  ANY  REAL-VALUED  LINEAR  FUNCTION  OF  THE 
PARAMETERS.  HIGHER-ORDER  SIMULTANEOUS  TESTS  CANNOT 
BE  MADE.  ALSO  SEE  BMDX6*(  (OR  BMDlOV). 

BMDO  7 V MULTIPLE  RANGE  TESTS 

PERFORMS  MULTIPLE  RANGE  TESTS  ON  TWO  KINDS  OF  IN- 
PUT! (II  RAW  DATA  WITH  ONE  VARIABLE  OF  CLASSIFICA- 
TION AND  UNEQUAL  GROUP  SIZES  AND  (2)  SAMPLE  SIZES, 
GROUP  MEANS,  STANDARD  DEVIATIONS,  AND  GROUP  LA- 
BELS. TEST  IS  DONE  ACCORDING  TO  USER  CARD  SPECIFI- 
CATIONS! OTHERWISE  IT  WILL  BE  DONE  USING  DUNCAN'S 
MULTIPLE  RANGE  TEST  SUPPLIED  BY  THE  PROGRAM. 

BMD08V  ANALYSIS  OF  VARIANCE 

PERFORMS  ANALYSIS  OF  VARIANCE  FOR  ANY  HIERARCHICAL 
DESIGN  WITH  EQUAL  CELL  SIZES.  THIS  INCLUDES  NEST- 
ED, PARTIALLY  NESTED  AND  PARTIALLY  CROSSED,  AND 
FULLY  CROSSED  DESIGNS.  THE  MODEL  IS  SPECIFIED  BY 
INDICATING  THE  NESTING  RELATIONSHIPS  OF  THE  IN- 
DICES. SEPARATE  ANALYSES  MAY  BE  PERFORMED  SIMUL- 
TANEOUSLY ON  SEVERAL  DEPENDENT  VARIABLES.  EACH 
ANALYSIS  OF  VARIANCE  TABLE  INCLUDES  AN  EXPECTED 
MEAN  SQUARE  IN  TERMS  OF  THE  POPULATION  VARIANCE 
COMPONENTS.  VERY  VERSATILE  AS  COMPARED  WITH  BMDOlV 
AND  BMD02V. 

BMDX63  MULTIVARIATE  GENERAL  LINEAR  HYPOTHESIS 

(UR  BMD1IV)  PERFORMS  MULTIPLE  REGRESSION  ON  A DEPENDENT  VARI- 
ABLE IN  THE  FORM  OF  A VECTOR.  COMPUTES  LIKELIHOOD 
RATIO  STATISTICS  AND  APPROXIMATE  F-STATISTIC  TO 
TEST  MULTIVARIATE  HYPOTHESES  OF  THE  FORM 
A B C'  = D WHERE  B IS  A MATRIX  OF  regression  CO- 
EFFICIENTS AND  A,  C,  AND  D ARE  MATRICES  SPECIFIED 
BY  THE  USER.  WITH  PROPER  SPECIFICATION,  THE  PRO- 
GRAM CAN  BE  USED  TO  CARRY  OUT  BALANCED  OR  UNBAL- 
ANCED MULTIVARIATE  ANALYSES  OF  VARIANCE  AND  CO- 
VARIANCE. 

BMDX6H  GENERAL  LINEAR  HYPOTHESIS 

(OR  BMDIOv I ESTIMATES  PARAMETERS  AND  TESTS  HYPOTHESES  CONCERN- 
ING A GENERAL  LINEAR  MODEL.  DESIGN  VARIABLES  FOR 
THE  ANALYSIS  OF  VARIANCE  ARE  GENERATED  BY  THE  PRO- 
GRAM. MODEL  1 ANALYSES  OF  VARIANCE  AND  COVARIANCE 
TESTS  ARE  PERFORMED  AUTOMATICALLY  ALONG  WITH  TESTS 
OF  USER-SPECIFIED  LINEAR  HYPOTHESES.  THE  PROGRAM 
DOES  NOT  REQUIRE  BALANCED  OR  EVEN  FULL  RANK  MOD- 
E 
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BMDX69  MULTIVARIATE  ANALYSIS  OF  VARIANCE  AND  COVARIANCE 

(OR  BMD12V)  PERFORMS  MODEL  I UNIVARIATE  OR  MULTIVARIATE  ANAL- 
YSIS OF  VARIANCE  OR  COVARIANCE  FOR  ANY  COMPLETE 
BALANCED  DESIGN  WITH  EQUAL  CELL  SIZES.  THE  DESIGN 
IS  SPECIFIED  BY  INDICATING  NESTING  RELATIONS  FOR 
THE  CLASSIFICATION  INDICES. 

BMDX82  ANALYSIS  OF  COVARIANCE 

(OR  BMD09 v ) PERFORMS  A ONE-WAY  ANALYSIS  OF  COVARIANCE,  USING 

ONE  OR  MORE  COVARIATES.  GROUP  SIZES  MAY  BE  UNEQUAL 
AND  PARALLEL  ANALYSES  MAY  BE  PERFORMED  USING  SEV- 
ERAL DEPENDENT  VARIABLES.  THE  PRQ6R AM  TESTS  FOR 
EQUALITY  OF  SLOPE  FROM  GROUP  TO  GROUP  AND  COMPUTES 
T-STATISTICS  FOR  CONTRASTS  OF  ADJUSTED  GROUP 
MEANS,  INCLUDING,  IN  PARTICULAR,  ALL  PAIRWISE  DIF- 
FERENCES. 


72 


MATHEMATICAL  and  statistical  software  index 


M.1.3.  documentation  references 


DIXON,  W.J.  (ED.).  8 MD  BIOMEDICAL  COMPUTER  PROGRAMS. 

BERKELEY,  CALIFORNIA!  UNIVERSITY  OF  CALIFORNIA  PRESS, 
1968. 


DIXON,  W.J.  (ED.).  BMD  BIOMEDICAL  COMPUTER  PROGRAMS  X-SER1ES 
SUPPLEMENT.  BERKELEY,  CALIFORNIA!  UNIVERSITY  OF  CALIFOR- 
NIA PRESS,  1970. 

DIXON,  W.J.  (ED.).  BMD  BIOMEDICAL  COMPUTER  PROGRAMS. 

BERKELEY,  CALIFORNIA!  UNIVERSITY  OF  CALIFORNIA  PRESS, 

1 973. 

DIXON,  W.J.,  b BROWN,  M.B.  (EDITORS).  BMDP-77  BIOMEDICAL 
COMPUTER  PROGRAMS,  P-SERIES.  BERKELEY,  CALIFORNIA! 
UNIVERSITY  OF  CALIFORNIA  PRESS,  1977. 


73 


MATHEMATICAL  and  statistical  software  index 


4.2.  STATJOB 


4.2.1.  general  description 


STATJOB  is  a system  of  computer  programs  with  general-purpose 
STATISTICAL  analysis  capabilities  developed  at  the  university  of 

W I SCONS  I N-mAD I SON  ACADEMIC  COMPUTING  CENTER  (MACCI  FOR  USE  ON 
UNIVAC  IIOO  SERIES  COMPUTERS.  IN  ADDITION  TO  ITS  USE  AT  THE 
UNIVERSITY  OF  WISCONSIN,  STATJOB  IS  USED  AT  OVER  FORTY  OTHER  SITES 
HAVING  UNUAC  1100  SERIES  COMPUTERS,  INCLUDING  AFHRL.  DEVELOPMENT 
OF  STATJOB  BEGAN  AT  MACC  IN  1966,  AND  UNTIL  MARCH  1 974  , STATJOB 

WAS  THE  Only  INTEGRATED  PACKAGE  of  statistical  programs  available 
AT  THE  UNIVERSITY  OF  WISCONSIN.  THE  WIDESPREAD  USE  OF  STATJOB 
(E.G.,  BY  USERS  IN  BIOLOGICAL  SCIENCE,  SOCIAL  SCIENCE,  ENGINEERING 
AND  PHYSICAL  SCIENCE,  BUSINESS  AND  ECONOMICS.  AND  GENERAL 
STATISTICS)  OVER  AN  EXTENDED  PERIOD  OF  TIME  AT  THE  UNIVERSITY  OF 
WISCONSIN  RESULTED  IN  A SYSTEM  WHICH  PROVIDES  OVERALL  GOOD 
PERFORMANCE.  PLANS  FOR  CONTINUED  EXPANSION  OF  THE  SYSTEM  WERE 
DROPPED  BY  STATJOB  DEVELOPERS,  HOWEVER,  AFTER  RELEASE  OF  VERSION 
10  IN  APRIL  1976,  AND  PRESENT  PROGRAMMING  SUPPORT  FOR  STATJOB  AT 
MACC  CONSISTS  OF  MA I NTE NANCE  ONLY. 

IN  VERSION  10  OF  STATJOB  (WHICH  IS  AVAILABLE  IN  THE  AFHRL 
SOFTWARE  LIBRARY),  THE  VARIOUS  PROCEDURES  AVAILABLE  FOR  THE 
ANALYSIS  OF  DATA  ARE  DIVIDED  AMONG  13  PROGRAMS.  THE  FOLLOWING  LIST 
GIVES  THE  NAMES  OF  THESE  PROGRAMS  AND  THE  METHODOLOGY  PROVIDED  BY 
E ACH  : 

COLFREQI  - SINGLE  COLUMN  FREQUENCY  COUNTS; 

CROsT  AB2  - DATA  TABULATION! 

DISCRIM1  - DISCRIMINANT  analysis; 

D S T a T 2 - DESCRIPTIVE  STATISTICS  AND  CORRELATIONS 

F A C TOR  3 - FACTOR  ANALYSIS! 

NWAYI  - GENERAL  ANALYSIS  OF  VARIANCE; 

0NEWAY2  - ONE-WAY  ANALYSIS  OF  VARIANCE! 

P1CTI  - DATA  GRAPHS  AND  TABLES? 

REGAN2  - MULTIPLE  LINEAR  REGRESSION! 

ROTATEl  - FACTOR  ROTATION! 

STEPREG I - STEPWISE  LINEAR  REGRESSION! 

TRANS1  - DATA  TRANSFER,  INCLUDING  TRANSFORMATION, 

RECODING,  REFORMATTING,  AND  SORTING  OF  DATA! 

UN  1 s T A T 2 - UNIVARIATE  STATISTICS  AND  HISTOGRAMS. 

DESCRIPTIONS  AND  PROCEDURES  FOR  THE  USE  OF  THESE  PROGRAMS  ARE 
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PROVIDED  IN  a SERIES  OF  REFERENCE  MANUALS.  THE  SERIES  INCLUDES  A 
MANUAL  FOR  EACH  ANALYSIS  PROGRAM  AND  MANUALS  DESCRIBING  THE 
STATJOB  FEATURES  FOR  DATA  HANDLING.  IN  ADDITION  TO  PROVIDING 
INFORMATION  ON  THE  USE  OF  THE  STATJOB  PROGRAMS,  THE  STATJOB 
PROGRAM  manuals  CONTAIN  COMPLETE  DESCRIPTIONS  OF  COMPUTATIONAL 
METHODS  AND  STATISTICAL  FORMULAS.  THE  FOLLOWING  SECTION  CONTAINS 
INFORMATION  EXTRACTED  FROM  THE  13  STATJOB  PROGRAM  MANUALS  WHICH 
BRIEFLY  DESCRIBES  THE  MAJOR  FEATURES  OF  EACH  PROGRAM. 
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<*.2.2.  LIST  OF  PROGRAMS  AND  PROGRAM  DESCRIPTIONS 


COLFREQ 1 SINGLE  COLUMN  FREQUENCY  COUNTS 

PERFORMS  SINGLE  COLUMN  FREQUENCY  COUNTS  OF  LETTERS, 
NUMERALS,  AND  OTHER  CHARACTERS  FOR  SPECIFIED  COLUMNS 
OF  INPUT  RECORDS  UP  TO  2000  CHARACTERS  IN  LENGTH.  IN 
ADDITION  TO  A TABLE  OF  FREQUENCIES  FOR  THE  CHARACTERS 
APPEARING  IN  SPECIFIED  COLUMNS,  STANDARD  OUTPUT  ALSO 
INCLUDES  A SUMMARY,  BY  COLUMN,  OF  THE  FREQUENCY 
COUNTS  AND  PERCENTAGES  FOR  FOUR  GROUPS  OF  CHARACTERS: 
NUMERALS,  LETTERS,  BLANKS,  AND  SPECIAL  CHARACTERS. 
ALSO  INCLUDED  ARE  THE  NUMERIC  MEAN  OF  EACH  COLUMN  AND 
THE  ADJUSTED  MEAN,  TREATING  BLANKS  AS  ZEROES.  AS 
OPTIONAL  OUTPUT,  A TABLE  OF  PERCENTAGES  BY  COLUMN  IS 
AVAILABLE  GIVING  THE  PERCENT  OF  FREQUENCY  OF  EACH 
CHARACTER. 

CR0STAB2  DATA  TABULATION 

PRODUCES  MULTIDIMENSIONAL  C R 0 S S - T A B U L A T I 0 N S WITH  NO 
RESTRICTIONS  ON  THE  NUMBER  OF  DIMENSIONS.  IN  EACH 
TABULATION,  ONE  OR  MORE  VARIABLES  MAY  BE  TREATED  AS 
CLASSIFICATION  VARIABLES  WHILE  OTHER  VARIABLES  ARE 
USED  AS  ACCUMULATION  VARIABLES  TO  OBTAIN  CELL  MEANS, 
STANDARD  DEVIATIONS  AND  OTHER  STATISTICS,  IN  AODITION 
TO  FREQUENCY  COUNTS  AND  PERCENTS.  EITHER  NUMERIC 
(CONTINUOUS)  VARIABLES  OR  ALPHANUMERIC  (DISCRETE) 
VARIABLES,  OR  BOTH,  CAN  BE  USED  AS  CLASSIFICATION 
VARIABLES.  THERE  ARE  FACILITIES  FOR  SPECIFICATION  OF 
CATEGORIES  OF  VALUES,  AND  LABELS  AND  WEIGHTS  ASSOCI- 
ATED WITH  THEM.  DISTINCTION  CAN  BE  MADE  FOR  CLASSIFI- 
CATION VARIABLES  OF  PRIMARY  AND  SECONDARY  INTEREST. 
MARGINAL  TOTALS,  PERCENTAGES,  AND  BIVARIATE  STATIS- 
TICS ARE  GIVEN  BASED  ON  PRIMARY  CATEGORIES  ONLY.  CHI- 
SQUARE,  TAU,  AND  GAMMA  BIVARIATE  STATISTICS  CAN  BE 
COMPUTED  OR  A U S E R - S U P P L I E D ROUTINE  CAN  COMPUTE  BI- 
VARIATE STATISTICS.  SPECIAL  PROCEDURES  ARE  INCLUDED 
FOR  HANDLING  MISSING  DATA. 

DISCRlMj  DISCRIMINANT  ANALYSIS 

PERFORMS  DISCRIMINANT  ANALYSIS  ON  TWO  OR  MORE  GROUPS 
OF  DATA  USING  A SET  OF  PREDICTOR  OR  DISCRIMINATING 
VARIABLES.  CLASSICAL  AND  STEPWISE  VARIABLE  SELECTION 
PROCEDURES,  OR  A COMBINATION  OF  THESE  PROCEDURES,  ARE 
PROVIDED  WITH  RAO'S  V STATISTIC  BEING  USED  AS  THE 
CRITERION  FOR  ENTERING  A VARIABLE  OR  A SET  OF  VARI- 
ABLES INTO  THE  ANALYSIS.  OUTPUT  INCLUDES  DESCRIPTIVE 
STATISTICS  BY  GROUP  AND  FOR  THE  TOTAL  SAMPLE!  BIVAR- 
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I A T E STATISTICS  (MOMENTS,  COVARIANCES,  CORRELATIONS) 
FOR  EACH  GROUP,  WITHIN  GROUPS,  AMONG  GROUPS,  AND  FOR 
THE  TOTAL  -.AMPLE?  AND  MAHALANLBIS'  D-SQUARE  STATISTIC 
FOR  PAIRS  OF  GROUPS.  A UNIVARIATE  ANALYSIS  OF  VARI- 
ANCE ON  EACH  VARIABLE  MAY  ALSO  BE  PROVIDED,  AS  WELL 
AS  A TEST  OF  THE  EQUALITY  OF  GROUP  COVARIANCE  MA- 
TRICES. FOR  EACH  STEP  IN  A 5TEPW1SE  ANALYSIS,  RAO'S 
V STATISTIC  AND  SIGNIFICANCE  LEVEL  ARE  PROVIDED,  WHILE 
OUTPUT  FOR  THE  FINAL  SOLUTION  OF  ANALYSIS  INCLUDES 
WILKS'  LAMBDA  TEST  OF  THE  EQUALITY  OF  GROUP  MEANS • 
EIGENVALUES  AND  EIGENVECTORS  AND  OTHER  RESULTS  ASSO- 
CIATED WITH  THE  DISCRIMINANT  FUNCTIONS;  AND  THE  RE- 
DUCED SPACE  CENTROIDS,  DISPERSION  MATRICES,  AND 
ASSOCIATED  RESULTS. 

DESCRIPTIVE  STATISTICS  AND  CORRELATION 

COMPUTES  A CORRELATION  MATRIX  AND  A SET  OF  DESCRIPTIVE 

statistics  for  a data  set  consisting  of  observations 

ON  A SET  OF  VARIABLES.  RAW  INPUT  DATA  MAY  CONTAIN 
MISSING  DATA.  OPTIONALLY  PROVIDES  FISHER'S  Z-TRANS- 
FORMATION  OF  THE  CORRELATION  COEFFICIENTS  AND  CORRE- 
SPONDING SIGNIFICANCE  LEVELS  I T-TR ANSFORMAT  I ONS  OF 
THE  CORRELATIONS  AND  CORRESPONDING  SIGNIFICANCE  LEV- 
ELS,* THE  COVARIANCE  MATRIX;  AND  THE  RAW  CROSS-PROD- 
UCTS  MATRIX.  FOR  MISSING  DATA,  THE  CORRELATION  MATRIX 
IS  COMPUTED  USING  THE  BIVARIATE  SUBSAMPLE  METHOD,  OR 
optionally,  THE  REPLACEMENT  with  means  method. 

factor  analysis 

COMPUTES  A FACTOR  LOADING  OR  PATTERN  MATRIX  EITHER  BY 
FACTORING  THE  COVARIANCE  MATRIX  (FOR  A SET  OF  INPUT 
VARIABLES)  WITH  THE  PRINCIPAL  COMPONENTS  PROCEDURE  OR 
BY  FACTORING  THE  CORRELATION  MATRIX  OF  THE  INPUT 
VARIABLES  USING  ONE  OF  FIVE  PROCEDURES*.  PRINCIPAL 

components;  principal  factor;  uniqueness  rescaling 

OR,  OPTIONALLY,  RAO'S  CANONICAL  FACTOR  PROCEDURE;  AL- 
PHA*, AND  IMAGE.  COMMUNAL1TY  ESTIMATES  MAY  BE  PROVIDED 
OR  THEY  MAY  BE  DETERMINED  FROM  THE  DATA  AS  SQUARED 
MULTIPLE  CORRELATIONS.  VARIMAX  OR  QUARTlMAX  ROTATIONS 
ARE  OPTIONALLY  AVAILABLE.  FACTOR  SCORES  MAY  BE  RE- 
QUESTED FOR  THE  PRINCIPAL  COMPONENTS  AND  IMAGE  PROCE- 
DURES. STANDARD  OUTPUT  INCLUDES  CORRELATIONS,  DE- 
SCRIPTIVE STATISTICS,  COMMUNAL  I T I ES  , EIGENVALUES, 
FACTOR  LOADINGS,  AND  ANALYSIS  OF  FACTOR  VARIANCE. 

GENERAL  ANALYSIS  OF  VARIANCE 

PERFORMS  AN  ANALYSIS  OF  VARIANCE  FOR  ANY  COMPLETE  FAC- 
TORIAL DESIGN  WITH  UP  TO  12  FACTORS.  EACH  PAIR  OF  FAC- 
TORS MUST  BE  EITHER  COMPLETELY  CROSSED  OR  COMPLETELY 
NESTED.  IF  THE  CELLS  OF  THE  DESIGN  DO  NOT  HAVE  THE 
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SAME  NUMBER  OF  REPLICATES,  AND  IF  REPLICATES  ARE  NOT 
USED  AS  A FACTOR,  THEN  AN  UNWEIGHTED  MEANS  ANALYSIS  IS 
PERFORMED  (EXCEPT  FOR  ONE-WAY  ANALYSIS  WHICH  IS  EX- 
ACT), IF  REPLICATES  ARE  USED  AS  A FACTOR,  THEN  HENDER- 
SON * S METHOD  1 IS  USED.  STANDARD  OUTPUT  FROM  NWAYl 
CONSISTS  OF  AN  ANALYSIS  OF  VARIANCE  TABLE  CONTAINING, 
FOR  EACH  SOURCE  OF  VARIATION  IN  THE  DESIGN,  THE  SUM  OF 
SQUARE5,  DEGREES  OF  FREEDOM,  MEAN  SQUARE,  F-RATIO  AND 
SIGNIFICANCE  LEVEL,  AND  ESTIMATE  OF  VARIANCE  COMPO- 
NENT. OPTIONAL  OUTPUT  INCLUDES  TABLES  OF  CELL  COUNTS, 
MEANS  AND  STANDARD  DEVIATIONS  OR  VARIANCES!  TABLES  OF 
EXPECTED  MEAN  SQUARES;  AND  TABLES  OF  DENOMINATOR  MEAN 
SQUARES  AND  DEGREES  OF  FREEDOM. 

ONE-WAY  ANALYSIS  OF  VARIANCE 

PERFORMS  ONE-WAY  ANALYSIS  OF  VARIANCE  AND  PERMITS  COM- 
PARISONS OF  CELL  MEANS  BY  T-TESTS,  LINEAR  CONTRASTS, 
ORTHOGONAL  POLYNOMIAL  CONTRASTS,  MULTIPLE  RANGE  TESTS, 
AND  SIMPLE-EFFECTS  ANALYSIS  OF  VARIANCE.  STANDARD  OUT- 
PUT CONSISTS  OF  SEVERAL  DESCRIPTIVE  STATISTICS  FOR 

each  group  and  the  analysis  of  variance  table,  option- 
al OUTPUT  INCLUDES  RESIDUALS,  HISTOGRAMS  FOR  EACH 
GROUP,  ETA  SQUARED  AND  OMEGA  SQUARED,  ESTIMATE  OF 
GROUP  VARIANCES  FOR  THE  RANDOM  EFFECTS  MODEL,  AND 
OUTPUT  FOR  THE  VARIOUS  TYPES  OF  CELL  MEAN  COMPARISONS 
AVAILABLE.  OPTIONAL  OUTPUT  IS  ALSO  GIVEN  FOR  THE  FOL- 
LOWING TESTS  OF  HOMOGENEITY  OF  VARIANCE!  BARTLETT’S 
test;  cochran’s  test;  f-max  test;  bartlett-kendall-box 
test;  box-anderson  test;  levene  test;  and  jackknife 
test,  paired  t-tests  or  one  sample  t-tests  can  be  per- 
formed by  calculating  difference  scores  and  examining 

THE  TEST  FOR  THE  MEAN  IN  THE  ANALYSIS  OP  VARIANCE  TA- 
BLE. UP  TO  1000  VARIABLES  CAN  BE  ANALYZED  IN  ONE  COM- 
PUTER RUN  WITH  THE  MAXIMUM  NUMBER  OF  GROUPS  ALLOWED 
RANGING  FROM  BOO  TO  1000  DEPENDING  WHICH  OPTIONAL  OUT- 
PUTS ARE  REQUESTED. 

SCATTERPLOTS , GRAPHS,  AND  TABLES 

PROVIDES  BIVARIATE  SCATTERPLOTS,  GRAPHS  AND  TABLES  OF 
THE  VALUES  OF  SELECTED  VARIABLES  FROM  A DATA  SET. 
SCATTERPLOTS  CONSIST  OF  PLOTS  OF  THE  DATA  VALUES  OF 
PAIRS  OF  INPUT  VARIABLES  ON  A TWO-DIMENSIONAL  COORDI- 
NATE SYSTEM.  FACILITIES  ARE  PROVIDED  TO  DENOTE  UP  TO 
THIRTY-FIVE  OCCURRENCES  OF  A PARTICULAR  COORDINATE 
POINT  ANO  TO  CLASSIFY  DATA  POINTS  INTO  GROUPS  ON  THE 
BASIS  OF  A THIRD  VARIABLE  WITH  A UNIQUE  SYMBOL  FOR 
EACH  GROUP.  OPTIONAL  OUTPUT  WITH  THE  SCATTERPLOTS  IN- 
CLUDES REGRESSION  STATISTICS  AND  INDICATORS  OF  THE 
REGRESSION  LINE  ON  THE  PLOT  AND  A VARIETY  OF  GRID  SYS- 
TEMS. GRAPHS  CONSIST  OF  VALUES  FOR  UP  TO  EIGHT  DEPEN- 
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DENT  VARIABLES  PLOTTED  ON  THE  SAME  GRAPH  USING  THE 
SAME  SCALE  FOR  ALL  VARIABLES  ALONG  THE  VERTICAL  AXIS. 
THE  HORIZONTAL  AXIS  REPRESENTS  THE  ORDER  OF  OBSERVA- 
TIONS OF  THE  DATA  SET  OR  THE  VALUES,  SORTED  IN  AS- 
CENDING ORDER,  OF  A SPECIFIED  VARIABLE  IN  THE  DATA 
SET.  WHEN  MORE  THAN  ONE  VARIABLE  IS  INCLUDED  IN  THE 
VERTICAL  DIRECTION  OF  THE  GRAPH,  THE  CUMULATIVE  SUMS 
OF  SUCCESSIVE  VARIABLES  CAN  BE  PLOTTED  INSTEAD  OF  THE 
VALUES  THEMSELVES.  TABLES  PRODUCED  WITH  PlCTl  ARE  PRO- 
DUCED BY  LISTING  THE  VALUES  OR  CUMULATIVE  SUMS  OF  UP 
TO  EIGHT  VARIABLES  AGAINST  THE  ORDER  OF  THE  OBSERVA- 
TIONS WITHIN  THE  DATA  SET  OR  ACCORDING  TO  THE  ASCEND- 
ING ORDER  OF  VALUES  OF  A SPECIFIED  VARIABLE  FROM  THE 
DATA  SET. 

MULTIPLE  LINEAR  REGRESSION 

PERFORMS  MULTIPLE  LINEAR  REGRESSION  ANALYSIS  WITH  A 
CAPABILITY  OF  SOLVING  MANY  MODELS  IN  ONE  PASS  OF  THE 
DATA.  INDEPENDENT  VARIABLES  MAY  ENTER  THE  REGRESSION 
EQUATION  IN  GROUPS.  STANDARD  OUTPUT  INCLUDES  THE  COR- 
RELATION MATRIX  AND  DESCRIPTIVE  STATISTICS  FOR  ALL 
INPUT  VARIABLES.  AS  EACH  GROUP  OF  VARIABLES  ENTERS 
THE  EQUATION,  THE  FOLLOWING  OUTPUT  MAY  BE  OBTAINED! 
ESTIMATE  OF  STANDARD  ERROR  OF  THE  REGRESSION;  MULTI- 
PLE CORRELATION  COEFFICIENT  R,  R-SQUaRE,  AND  CORRECT- 
ED r-square;  ordinary  and  standardized  regression  co- 
efficients; STANDARD  ERRORS  OF  THE  REGRESSION  COEFFI- 
CIENTS’. PARTIAL  CORRELATION  COEFFICIENTS!  PARTIAL  F 
VALUE,  T-VALUE,  AND  SIGNIFICANCE  LEVEL  FOR  EACH  VARI- 
ABLES PARTIAL  F VALUE  AND  SIGNIFICANCE  LEVEL  FOR 
EACH  GROUP  OF  VARIABLES  ENTERING!  ANOVa  TABLE!  COVAR- 
IANCE OR  correlation  matrix  of  regression  coeffi- 
cients; RESIDUALS  AND  PREDICTED  VALUES,  WITH  A PLOT 
OF  STANDARDIZED  RESIDUALS. 

FACTOR  ROTATION 

PERFORMS  FACTOR  ANALYTIC  ROTATIONS  OF  AN  INPUT  MATRIX 
OF  FACTOR  LOADINGS,  USUALLY  FROM  AN  UNROTATED  FACTOR 
PATTERN  MATRIX.  HOWEVER,  A ROTATED  PATTERN  MATRIX  MAY 
ALSO  BE  INPUT.  THE  USER  MAY  SELECT  COLUMNS  FROM  THE 
LOADING  MATRIX  FOR  ROTATION.  MORE  THAN  ONE  TYPE  OF 
ROTATION  CAN  BE  MADE,  AND  MORE  THAN  ONE  SET  OF  FAC- 
TORS CAN  BE  ROTATED  IN  A SINGLE  RUN.  BOTH  ORTHOGONAL 
AND  OBLIQUE  ROTATIONS  ARE  AVAILABLE  WITH  THE  ORTHO- 
GONAL ROTATIONS  FROM  THE  ORTHOMAX  FAMILY,  TWO  CASES 
OF  WHICH  ARE  VAR1MAX  AND  QUARTIMAX  ROTATION.  TWO 
OBLIQUE  ROTATIONS  MAY  BE  SELECTED  FROM  THE  HARRIS  AND 
KAISER  FAMILY  OF  -ORTHOBL  I QUE"  ROTATIONS!  INDEPENDENT 
CLUSTER  ROTATION  (TYPE  1)  AND  P*P  PROPORTIONAL  TO 
FACTOR  CORRELATION  MATRIX  (TYPE  2).  STANDARD  OUTPUT 


MATHEMATICAL  and  statistical  software  index 


STEPRE6 1 


TRANS  l 


UNISTAT2 


INCLUDES  RAW  FACTOR  MATRIX!  ROTATED  FACTOR  PATTERN 
MATRIX!  VARIANCE  ANALYSIS  FOR  ROTATED  FACTORS!  FACTOR 
STRUCTURE  MATRIX  (OBLIQUE  ROTATIONS  ONLY)!  AND  COR- 
RELATION MATRIX  FOR  OBLIQUELY  ROTATED  FACTOR  SCORES* 

STEPWISE  LINEAR  REGRESSION  ANALYSIS 

PERFORMS  MULTIPLE  LINEAR  REGRESSION  IN  A STEPWISE 
MANNER  WHICH  IS  AN  EXPANSION  OF  THE  PROCEDURE  DE- 
SCRIBED BY  EFROYMSON  (PAGES  191-203)  IN  "MATHEMATICAL 
METHODS  FOR  DIGITAL  COMPUTERS,"  BY  R.  RALSTON  AND 
H.  WILF  (EDITORS).  ONE  SET  OF  INDEPENDENT  VARIABLES 
(BASE  VARIABLES)  CAN  BE  FORCED  INTO  THE  REGRESSION 
EQUATION  IN  THE  ORDER  SPECIFIED  ON  A CONTROL  CARD! 
THEY  MAY  ENTER  THE  EQUATION  EITHER  INDIVIDUALLY,  OR 
IN  GROUPS.  INDEPENDENT  VARIABLES  FROM  ANOTHER  SET 
(FREE  VARIABLES)  CAN  BE  SELECTED  FOR  ENTRY  OR  REMOVAL 
FROM  THE  EQUATION  WITH  EITHER  FORWARD  OR  BACKWARD 
STEPWISE  PROCEDURES.  EACH  PROCEDURE  HAS  OPTIONS  TO 
SPECIFY  SIGNIFICANCE  CRITERIA  FOR  ENTRY  OR  REMOVAL  OF 
INDEPENDENT  VARIABLES.  THE  CORRELATION  MATRIX  AND 
DESCRIPTIVE  STATISTICS  ARE  PRINTED  FOR  ALL  INPUT 
VARIABLES.  OUTPUT  MAY  BE  OBTAINED  AT  EACH  STEP,  OR  AT 
THE  FINAL  STEP  IN  THE  SOLUTION  PROCESS,  AND  IS  IDEN- 
TICAL WITH  THE  OUTPUT  THAT  MAY  BE  OBTAINED  WITH 
REGAN2.  AFTER  COMPLETION  OF  THE  ANALYSIS,  A SUMMARY 
OF  STEPS  IS  GIVEN,  FOLLOWED  BY  A LIST  OF  STATISTICS 
RELATED  TO  VARIABLES  NOT  IN  THE  FINAL  EQUATION. 

DATA  TRANSFER 

TRANSI  CAN  BE  USED  TO  SORT  DATA  SETS  ON  ONE  OR  MORE 
VARIABLES  IN  ASCENDING  OR  DESCENDING  ORDER.  ONE  OR 
MORE  VARIABLES  MAY  BE  RANK  ORDERED,  PERCENTILE 
RANKED,  OR  STANDARDIZED*  TRANSI  CAN  BE  USED  TO  STRAT- 
IFY A DATA  SET  INTO  SEVERAL  DATA  SETS  ON  THE  VALUE 
OF  AN  INPUT  VARIABLE.  TRANSI  PERFORMS  TWO  OTHER  FUNC- 
TIONS OF  A ROUTINE  DATA  PROCESSING  NATURE!  TRANSFER 
OF  DATA,  WITH  OR  WITHOUT  TRANSFORMATION,  FROM  ONE 
DATA  MODE  AND/OR  MEDIA  TO  ANOTHER!  AND  SERVES  AS  THE 
INPUT  ROUTINE,  WITH  OR  WITHOUT  TRANSFORMATION,  TO  A 
USER-SUPPLIED  ANALYSIS  ROUTINE. 

UNIVARIATE  STATISTICS  AND  HISTOGRAMS 

PRINTS  UNIVARIATE  DESCRIPTIVE  STATISTICS  AND  HISTO- 
GRAMS FOR  NUMERIC  VARIABLES,  AND  HISTOGRAMS  FOR  AL- 
PHANUMERIC VARIABLES.  ONE  OR  MORE  SUBSETS  OF  VARI- 
ABLES CAN  BE  SPECIFIED  FOR  ANALYSIS.  A WEIGHT  VARI- 
ABLE CAN  BE  SPECIFIED  FOR  ALL  COMPUTATIONS,  OR  A 
SEPARATE  WEIGHT  VARIABLE  CAN  BE  SPECIFIED  WITH  EACH 
SUBSET.  THE  DESCRIPTIVE  STATISTICS  INCLUDE  MEAN, 
VARIANCE,  STANDARD  DEVIATION,  MINIMUM  AND  MAXIMUM 
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VALUES,  STANDARD  ERROR  OF  THE  MEAN,  COEFFICIENT  OF 
VARIATION,  STUDENTIZEO  MEAN,  MEAN  ABSOLUTE  DEVIATION, 
AND  COEFFICIENTS  OF  SKEWNESS  AND  KURTOSIS!  CONFIDENCE 
INTERVALS  ON  THE  MEAN  AND  VARIANCE!  SELECTED  PERCEN- 
TILES, MEDIAN,  AND  TRI-MEAN!  TRIMMED  AND  WINSORIZED 

mean;  autocorrelations  up  to  any  order;  and  informa- 
tion ON  RUNS  UP  AND  DOWN,  AND  RUNS  ABOVE  AND  BELOW 
THE  MEDIAN.  FOR  NUMERIC  HISTOGRAMS,  EXPECTED  AS  WELL 
AS  OBSERVED  FREQUENCIES  CAN  BE  PLOTTED  FOR  THE  NORMAL 
OR  POISSON  PROBABILITY  MODELS.  WHEN  EXPECTED  FREQUEN- 
CIES ARE  PLOTTED,  PEARSON’S  CHI-SQUARE  GOODNESS-OF- 
F 1 T TEST  IS  PERFORMED. 
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4.2.3.  DOCUMENTATION  REFERENCES 


STATJOB  SERIES  OF  REFERENCE  MANUALS  FOR  UNIVAC  1100  SERIES 

COMPUTERS,  ACADEMIC  COMPUTING  CENTER,  UNIVERSITY  OF  WIS- 
CONSIN, MADISON,  WISCONSIN. 

THE  SERIES  INCLUDES  THE  FOLLOWING  MANUALS: 

INTRODUCTION  TO  STATJOB,  VERSION  10 
STATJOB  SUMMARY,  VERSION  10 

C0LPREQ1,  SINGLE  COLUMN  FREQUENCY  COUNTS  , . DECEMBER  , 1971 

CROST  A32 , DATA  TABULATION,  MARCH,  1975 
DISCRIM1,  DISCRIMINANT  ANALYSIS,  MARCH,  1975 
DSTAT2,  DESCRIPTIVE  STATISTICS  AND  CORRELATION,  SEPTEMBER, 
1 973 

PACT0R3,  FACTOR  ANALYSIS,  MAY,  1974 
NWAY1  FOR  PSYCHOLOGISTS,  MAY,  1971 

ONE  W A Y 2 , ONE-WAY  ANALYSIS  OF  VARIANCE,  NOVEMBER,  1975 
P1CT1,  SCATTERPLOTS , GRAPHS,  AND  TABLES,  SECOND  REVISION, 
APRIL,  1976 

REGAN2,  MULTIPLE  LINEAR  REGRESSION  ANALYSIS,  APRIL,  1971 
ROTATE!,  FACTOR  ROTATION,  NOVEMBER,  1974 
5TEPREG1,  STEPWISE  REGRESSION  ANALYSIS,  AUGUST,  1973 
TRANSl,  DATA  TRANSFER,  FIRST  REVISION,  DECEMBER.  1974 
UN  I S T A T 2 , UNIVARIATE  DESCRIPTIVE  STATISTICS  AND  HISTO- 
GRAMS, APRIL,  1976 
BASIC  SYSTEM  PROCEDURES,  MAY,  1970 
EXTENDED  SYSTEM  PROCEDURES,  FEBRUARY,  1972 
STJBANK  PROCEDURES,  OCTOBER,  1974 
TRANSFORMATION  PROCEDURES,  OCTOBER,  1975. 
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